nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-FP8 Any-to-Any • 33B • Updated about 17 hours ago • 94.8k • 44
deepseek-ai/DeepSeek-V4-Flash Text Generation • 158B • Updated about 3 hours ago • 561k • • 952
Running 3.83k The Ultra-Scale Playbook 🌌 3.83k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-V4-Pro Text Generation • 862B • Updated about 3 hours ago • 631k • • 3.6k