nvidia/gpt-oss-120b-Eagle3-long-context Text Generation • 0.2B • Updated 16 days ago • 4.35k • 57
QuantStack/Qwen-Image-Layered-GGUF Image-Text-to-Image • 20B • Updated Dec 23, 2025 • 1.86k • 56
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 119
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 7 days ago • 51
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models Dec 15, 2025 • 107
Motif-Technologies/Motif-2-12.7B-Reasoning Text Generation • 13B • Updated Dec 12, 2025 • 768 • 41
ServiceNow-AI/Apriel-1.6-15b-Thinker Image-Text-to-Text • 15B • Updated Dec 22, 2025 • 3.57k • • 277
unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF Text Generation • 80B • Updated 28 days ago • 69.3k • 162