Running on CPU Upgrade Featured 2.69k The Smol Training Playbook π 2.69k The secrets to building world-class LLMs
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation β’ 2B β’ Updated Feb 24 β’ 2.59M β’ β’ 1.42k