Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 13 days ago • 166
view post Post 3121 NEW: @EssentialAI just released Rnj-1, their first 8B model. You can easily fine-tune it with GRPO using TRL to add reasoning capabilities to a compact modeFree Colab link: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_rnj_1_instruct.ipynbMore free TRL notebooks: https://huggingface.co/docs/trl/main/en/example_overview#notebooks See translation 🚀 7 7 + Reply