Search-R1 Collection Preliminary checkpoints with outcome-only RL. • 15 items • Updated Aug 12, 2025 • 17
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B Text Generation • 15B • Updated Feb 24, 2025 • 762k • • 614
unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF Text Generation • 31B • Updated Jan 30 • 163k • 534
Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation • 480B • Updated Aug 21, 2025 • 75.6k • • 1.31k
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF Text Generation • 480B • Updated Jul 31, 2025 • 3.56k • 172