meta-llama/Llama-3.2-1B-Instruct Text Generation • 1B • Updated Oct 24, 2024 • 8.77M • • 1.5k
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook 📚 3.22k The secrets to building world-class LLMs
Qwen/Qwen3-235B-A22B-Instruct-2507 Text Generation • 235B • Updated Sep 17, 2025 • 87k • • 785
huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated Text Generation • 33B • Updated Feb 16, 2025 • 2.66k • • 248
Effects of personality traits in predicting grade retention of Brazilian students Paper • 2107.05767 • Published Jul 12, 2021
Running 3.92k The Ultra-Scale Playbook 🌌 3.92k The ultimate guide to training LLM on large GPU Clusters