Running 3.74k The Ultra-Scale Playbook 🌌 3.74k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct Text Generation • 16B • Updated Jul 3, 2024 • 217k • • 558