Running on CPU Upgrade Featured 2.97k The Smol Training Playbook š 2.97k The secrets to building world-class LLMs
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B Text Generation ⢠8B ⢠Updated May 29, 2025 ⢠132k ⢠⢠1.03k