view post Post 3524 We collaborated with NVIDIA to teach you about Reinforcement Learning and RL environments. 💚 Learn:• Why RL environments matter + how to build them• When RL is better than SFT• GRPO and RL best practices• How verifiable rewards and RLVR workBlog: https://unsloth.ai/blog/rl-environments See translation 4 replies · 🔥 9 9 🤝 2 2 ❤️ 1 1 + Reply
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 80 items • Updated 5 days ago • 468
unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF Text Generation • 121B • Updated 5 days ago • 41.7k • 69
unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF Text Generation • 121B • Updated 5 days ago • 41.7k • 69