view post Post 3483 We collaborated with NVIDIA to teach you about Reinforcement Learning and RL environments. 💚 Learn:• Why RL environments matter + how to build them• When RL is better than SFT• GRPO and RL best practices• How verifiable rewards and RLVR workBlog: https://unsloth.ai/blog/rl-environments See translation 4 replies · 🔥 9 9 ❤️ 1 1 🤝 1 1 + Reply
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 80 items • Updated 5 days ago • 468
unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF Text Generation • 121B • Updated 5 days ago • 41.7k • 66
unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF Text Generation • 121B • Updated 5 days ago • 41.7k • 66
unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated 5 days ago • 40.1k • 18
unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated 5 days ago • 40.1k • 18