Running 3.83k The Ultra-Scale Playbook 🌌 3.83k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct Text Generation • 16B • Updated Jul 3, 2024 • 1.1M • • 594
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation • 71B • Updated Apr 13, 2025 • 11.7k • • 2.07k
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.64M • • 3k