davidanugraha/DeepSeek-R1-Distill-Qwen-7B-Overthinking-SFT Text Generation • 8B • Updated 30 days ago • 1
davidanugraha/DeepSeek-R1-Distill-Qwen-7B-Overthinking-SFT Text Generation • 8B • Updated 30 days ago • 1
davidanugraha/DeepSeek-R1-Distill-Qwen-1.5B-Overthinking-SFT Text Generation • 2B • Updated 30 days ago
davidanugraha/DeepSeek-R1-Distill-Qwen-1.5B-Overthinking-SFT Text Generation • 2B • Updated 30 days ago