Running 3.67k The Ultra-Scale Playbook 🌌 3.67k The ultimate guide to training LLM on large GPU Clusters
deepset/roberta-base-squad2 Question Answering • 0.1B • Updated Sep 24, 2024 • 723k • • 935
unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF Text Generation • 8B • Updated May 10, 2025 • 25.6k • 293