Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano v3. • 8 items • Updated 8 days ago • 60
view article Article Training LLM Agents to Act Under Adversarial Evidence with Multi-Reward Dual-Control RL 5 days ago • 2
PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models Paper • 2601.11087 • Published 13 days ago • 11
view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 24 days ago • 37
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking Paper • 2601.06487 • Published 19 days ago • 51
CausalARC: Abstract Reasoning with Causal World Models Paper • 2509.03636 • Published Sep 3, 2025 • 1
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 12 items • Updated 8 days ago • 52
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance Apr 16, 2025 • 61
Reinforcement Learning for Self-Improving Agent with Skill Library Paper • 2512.17102 • Published Dec 18, 2025 • 35
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 119
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 8 days ago • 50
NeMo Gym Collection Collection of RL verifiable data for NeMo Gym • 13 items • Updated 8 days ago • 38
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper • 2512.07843 • Published Nov 24, 2025 • 22