Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

29

Base only

Active filters: reward

li-jay-cs/test2-rlhf-rm-checkpoint

Updated Dec 21, 2023 • 3

li-jay-cs/gpt2-medium-rlhf-rm-checkpoint

Updated Dec 25, 2023 • 3

li-jay-cs/test3-rlhf-rm-checkpoint

Updated Dec 24, 2023 • 5

li-jay-cs/gpt2-rlhf-rm-checkpoint

Updated Dec 24, 2023 • 3

li-jay-cs/gpt2-training-full-rlhf-rm-checkpoint

Updated Dec 25, 2023 • 2

li-jay-cs/gpt2-last_token_reward_and_full_training-rlhf-rm-checkpoint

Updated Dec 25, 2023 • 4

li-jay-cs/1gpu-gpt2-myepoch1-gcp-reward-model

Updated Jan 12, 2024 • 4

thobauma/opt-350m

Text Classification • 0.3B • Updated Apr 25, 2024 • 2

ZhangNy/2024-11-18_10-58-28

0.2B • Updated Nov 18, 2024 • 1

TIGER-Lab/AceCodeRM-7B

8B • Updated Apr 9, 2025 • 24 • 6

TIGER-Lab/AceCodeRM-32B

33B • Updated Apr 9, 2025 • 10 • 8

eth-nlped/Qwen2.5-1.5B-pedagogical-rewardmodel

Text Classification • 2B • Updated Nov 18, 2025 • 694 • 4

NiuTrans/GRAM-Qwen3-1.7B-RewardModel

2B • Updated Jun 26, 2025 • 5 • 6

NiuTrans/GRAM-Qwen3-14B-RewardModel

15B • Updated Jun 26, 2025 • 8 • 4

NiuTrans/GRAM-LLaMA3.2-3B-RewardModel

3B • Updated Jun 26, 2025 • 3 • 3

NiuTrans/GRAM-Qwen3-4B-RewardModel

4B • Updated Jun 26, 2025 • 9 • 2

NiuTrans/GRAM-Qwen3-8B-RewardModel

8B • Updated Jun 26, 2025 • 15 • 4

prithivMLmods/GRAM-LLaMA3.2-3B-RewardModel-GGUF

Text Ranking • 3B • Updated Jul 28, 2025 • 164

prithivMLmods/GRAM-Qwen3-4B-RewardModel-GGUF

Text Ranking • 4B • Updated Jul 28, 2025 • 3

mradermacher/GRAM-LLaMA3.2-3B-RewardModel-GGUF

3B • Updated Aug 29, 2025 • 26

mradermacher/GRAM-LLaMA3.2-3B-RewardModel-i1-GGUF

3B • Updated Dec 23, 2025 • 62

TIGER-Lab/EditReward-MiMo-VL-7B-SFT-2508

Image-to-Text • 8B • Updated Dec 23, 2025 • 68 • 1

TIGER-Lab/EditReward-Qwen2.5-VL-7B

Image-Text-to-Text • 8B • Updated Dec 23, 2025 • 52 • 3

newmindai/Muhakim

Text Generation • Updated Jan 27 • 21 • 4

LARK-Lab/CodeScaler-8B

Text Classification • 8B • Updated Feb 23 • 70

LARK-Lab/CodeScaler-1.7B

Text Classification • 2B • Updated Feb 23 • 4

LARK-Lab/CodeScaler-4B

Text Classification • 4B • Updated Feb 23 • 4

TIGER-Lab/RationalRewards-8B-T2I

Image-to-Text • 9B • Updated Apr 14 • 220 • 4

TIGER-Lab/RationalRewards-8B-Edit

Image-to-Text • 9B • Updated Apr 14 • 286 • 3