ThinkPRM Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 launch/ThinkPRM-1.5B Text Generation • 2B • Updated Jun 25, 2025 • 33 • 3 launch/ThinkPRM-7B Text Generation • 8B • Updated May 17, 2025 • 12 • 1 launch/ThinkPRM-14B Text Generation • 15B • Updated Jul 1, 2025 • 394 • 6 mradermacher/ThinkPRM-7B-i1-GGUF 8B • Updated Jul 11, 2025 • 2.36k
ThinkPRM Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 launch/ThinkPRM-1.5B Text Generation • 2B • Updated Jun 25, 2025 • 33 • 3 launch/ThinkPRM-7B Text Generation • 8B • Updated May 17, 2025 • 12 • 1 launch/ThinkPRM-14B Text Generation • 15B • Updated Jul 1, 2025 • 394 • 6 mradermacher/ThinkPRM-7B-i1-GGUF 8B • Updated Jul 11, 2025 • 2.36k