kakaocorp/kanana-2-30b-a3b-thinking-2601 Text Generation • 31B • Updated 12 days ago • 1.14k • 54
LGAI-EXAONE/K-EXAONE-236B-A23B Text Generation • 237B • Updated about 21 hours ago • 13.5k • 523
naver-hyperclovax/HyperCLOVAX-SEED-Think-32B Text Generation • 33B • Updated 21 days ago • 32.9k • 391
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation • 32B • Updated 8 days ago • 440k • 593
Running on CPU Upgrade Featured 2.93k The Smol Training Playbook 📚 2.93k The secrets to building world-class LLMs
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 24 items • Updated 4 days ago • 92
Qwen/Qwen3-VL-30B-A3B-Instruct-FP8 Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 230k • 97
deepseek-ai/DeepSeek-V3.2-Exp Text Generation • 685B • Updated Nov 18, 2025 • 63.3k • • 943