Reward Models
updated
nvidia/Llama-3.3-Nemotron-70B-Reward-Multilingual
Text Generation
• 71B • Updated
• 28
• 10
nvidia/Llama-3.3-Nemotron-70B-Reward-Principle
Text Generation
• 71B • Updated
• 292
• 6
nvidia/Qwen-3-Nemotron-32B-Reward
Text Classification
• 32B • Updated
• 97
• 19
Skywork/Skywork-Reward-V2-Llama-3.1-8B
Text Classification
• 8B • Updated
• 23.5k
• 41
Text Classification
• 8B • Updated
• 79
• 9
allenai/Llama-3.1-70B-Instruct-RM-RB2
Text Classification
• Updated
• 32
• 1
allenai/Llama-3.1-8B-Instruct-RM-RB2
Text Classification
• Updated
• 207
• 1
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
• Updated
• 18.9k
• 182
nvidia/Llama-3.3-Nemotron-70B-Select
Text Generation
• 71B • Updated
• 29
• 11
nvidia/Llama-3.3-Nemotron-70B-Edit
Text Generation
• 71B • Updated
• 18
• 3
nvidia/Llama-3.3-Nemotron-70B-Feedback
Text Generation
• 71B • Updated
• 18
• 8
allenai/Llama-3.1-Tulu-3-8B-RM
Text Classification
• 8B • Updated
• 58
• 19
Text Classification
• Updated
• 66.9k
• 82
NCSOFT/Llama-3-OffsetBias-RM-8B
Text Classification
• 8B • Updated
• 118
• 24
NCSOFT/Llama-3-OffsetBias-8B
Text Generation
• 8B • Updated
• 29
• 14
nvidia/Qwen2.5-CascadeRL-RM-72B
Text Generation
• 71B • Updated
• 372
• 11
general-preference/GPM-Llama-3.1-8B
8B • Updated
• 174
• 1