Running 3.6k The Ultra-Scale Playbook 🌌 3.6k The ultimate guide to training LLM on large GPU Clusters
Deepseek Papers Collection Deepseek papers collection • 27 items • Updated about 6 hours ago • 290