uiuc-kang-lab
's Collections
RL Generalizability
updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-step-100
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-dapo
2B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-12-6
2B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-10-6
2B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-8-6
2B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-6-6
2B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-4-6
2B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-11-6
2B
•
Updated
•
2
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-9-6
2B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-7-6
2B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-5-6
2B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-3-6
2B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-1-6
2B
•
Updated
•
2
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-2-6
2B
•
Updated
•
3
uiuc-kang-lab/Llama3.2-3B-Instruct-math
3B
•
Updated
•
3
uiuc-kang-lab/R1-Distill-Qwen-1.5B-mixed
2B
•
Updated
•
3