V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts Paper • 2603.10848 • Published 3 days ago • 7
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16 Text Generation • 124B • Updated 3 days ago • 1.41k • 15
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 1 day ago • 8.96k • 108
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated about 12 hours ago • 6.44k • 173
ibm-granite/granite-guardian-3.2-8b-factuality-detection Text Generation • 8B • Updated 11 days ago • 125 • 3
ibm-granite/granite-4.0-1b-speech Automatic Speech Recognition • 2B • Updated about 9 hours ago • 11.3k • 103
nvidia/Nemotron-Research-GooseReason-4B-Instruct Text Generation • 4B • Updated 13 days ago • 256 • 7