Running 3.84k The Ultra-Scale Playbook 🌌 3.84k The ultimate guide to training LLM on large GPU Clusters
Blasserman/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct-Q4_K_M-GGUF 8B • Updated Apr 27, 2025 • 11 • 1