// Live community signal
Rate this model
4
RAI · live
About
NVIDIA's open reward model used for synthetic data and alignment pipelines.
Dimension Scores
intelligence96
creativity88
accuracy94
speed72
coding98
research92
value84
safety90
▲ Pros
- +Open weights
- +Strong RLHF signal
- +Permissive license
▼ Cons
- −Niche use case
- −Heavy compute
Community Reviews
Reviews open once this model is indexed in the live database.