Running 3.77k The Ultra-Scale Playbook 🌌 3.77k The ultimate guide to training LLM on large GPU Clusters
gradientai/Llama-3-8B-Instruct-262k Text Generation • 8B • Updated Oct 28, 2024 • 1.68k • 261