efficient-reasoning - a Zanette-Labs Collection

Zanette-Labs 's Collections

efficient-reasoning

efficient-reasoning

updated Apr 13, 2025

Checkpoints for models trained in https://arxiv.org/abs/2502.04463

daman1209arora/alpha_0.4_DeepSeek-R1-Distill-Qwen-7B

Text Generation • 8B • Updated Apr 13, 2025 • 3
daman1209arora/alpha_0.05_DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Apr 13, 2025 • 5
daman1209arora/alpha_0.05_DeepSeek-R1-Distill-Qwen-7B

Text Generation • 8B • Updated Apr 13, 2025 • 3
daman1209arora/alpha_0.2_DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Apr 13, 2025 • 3
daman1209arora/alpha_0.1_DeepSeek-R1-Distill-Qwen-7B

Text Generation • 8B • Updated Apr 13, 2025 • 6
daman1209arora/alpha_0.1_DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Apr 13, 2025 • 22 •
daman1209arora/alpha_0.2_DeepSeek-R1-Distill-Qwen-7B

Text Generation • 8B • Updated May 13 • 44
daman1209arora/alpha_0.4_DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Apr 13, 2025 • 12 •
daman1209arora/alpha_0_DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Apr 13, 2025 • 16 •
daman1209arora/alpha_0_DeepSeek-R1-Distill-Qwen-7B

Text Generation • 8B • Updated Apr 13, 2025 • 3