Inference Providers
Active filters: reward
li-jay-cs/test2-rlhf-rm-checkpoint
li-jay-cs/gpt2-medium-rlhf-rm-checkpoint
li-jay-cs/test3-rlhf-rm-checkpoint
li-jay-cs/gpt2-rlhf-rm-checkpoint
Updated • 15
li-jay-cs/gpt2-training-full-rlhf-rm-checkpoint
li-jay-cs/gpt2-last_token_reward_and_full_training-rlhf-rm-checkpoint
li-jay-cs/1gpu-gpt2-myepoch1-gcp-reward-model
Text Classification
• 0.3B • Updated • 11
ZhangNy/2024-11-18_10-58-28
0.2B • Updated • 1
8B • Updated • 280
• 6
33B • Updated • 30
• 8
eth-nlped/Qwen2.5-1.5B-pedagogical-rewardmodel
Text Classification
• 2B • Updated • 1.58k
• 4
NiuTrans/GRAM-Qwen3-1.7B-RewardModel
2B • Updated • 5
• 6
NiuTrans/GRAM-Qwen3-14B-RewardModel
15B • Updated • 6
• 4
NiuTrans/GRAM-LLaMA3.2-3B-RewardModel
3B • Updated • 7
• 3
NiuTrans/GRAM-Qwen3-4B-RewardModel
4B • Updated • 8
• 2
NiuTrans/GRAM-Qwen3-8B-RewardModel
8B • Updated • 5
• 4
prithivMLmods/GRAM-LLaMA3.2-3B-RewardModel-GGUF
Text Ranking
• 3B • Updated • 4
prithivMLmods/GRAM-Qwen3-4B-RewardModel-GGUF
Text Ranking
• 4B • Updated • 4
mradermacher/GRAM-LLaMA3.2-3B-RewardModel-GGUF
3B • Updated • 83
mradermacher/GRAM-LLaMA3.2-3B-RewardModel-i1-GGUF
3B • Updated • 85
TIGER-Lab/EditReward-MiMo-VL-7B-SFT-2508
Image-to-Text
• Updated • 59
• 1
TIGER-Lab/EditReward-Qwen2.5-VL-7B
Image-Text-to-Text
• Updated • 72
• 3
Text Generation
• Updated • 18
• 4
Text Classification
• 8B • Updated • 27
Text Classification
• 2B • Updated • 132
Text Classification
• 4B • Updated • 35
TIGER-Lab/RationalRewards-8B-T2I
Image-to-Text
• 9B • Updated • 226
• 4
TIGER-Lab/RationalRewards-8B-Edit
Image-to-Text
• 9B • Updated • 156
• 3