view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 9 days ago • 74
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 17 items • Updated 3 days ago • 36
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models 11 days ago • 100
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 26 days ago • 254
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 211