Self-Taught Self-Correction for Small Language Models Paper • 2503.08681 • Published Mar 11, 2025 • 16
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24, 2025 • 643k • • 1.53k
deepcogito/cogito-v1-preview-llama-3B Text Generation • 4B • Updated Apr 8, 2025 • 186 • • 101
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B Text Generation • 8B • Updated May 29, 2025 • 1.07M • • 1.08k
google/gemma-3-27b-it-qat-q4_0-gguf Image-Text-to-Text • 27B • Updated Apr 11, 2025 • 295 • 401