canada-quant/DeepSeek-V4-Flash-W4A16-FP8 Text Generation • 44B • Updated about 1 month ago • 11.2k • 16
Replacing thinking with tool usage enables reasoning in small language models Paper • 2507.05065 • Published Jul 7, 2025 • 17