Nemotron-Labs-Diffusion Collection A Tri-Mode Language Model Family Unifying Autoregressive, Diffusion, and Self-Speculation Decoding • 7 items • Updated 15 days ago • 50
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • 33B • Updated Feb 24, 2025 • 789k • • 1.57k
nm-testing/DeepSeek-R1-Distill-Qwen-32B-NVFP4 Text Generation • 19B • Updated Nov 21, 2025 • 1.08k • 3