view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 307
Running on CPU Upgrade Featured 3.08k The Smol Training Playbook 📚 3.08k The secrets to building world-class LLMs
ericzhang0328/loopllama3.2-1b-deepspeed-0904-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025 • 3
ericzhang0328/llama3.2-1b-cpt-deepspeed-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025 • 1
ericzhang0328/loopllama3.2-1b-deepspeed-0904-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025 • 3
ericzhang0328/llama3.2-1b-cpt-deepspeed-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025 • 1