·
AI & ML interests
None yet
Organizations
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_2e-5-Epoch_2
Text Generation
• 2B • Updated
• 1
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_1e-4-Epoch_2
Text Generation
• 2B • Updated
• 2
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_2e-5
Text Generation
• 2B • Updated
• 1
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_5e-5
Text Generation
• 2B • Updated
• 1
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_1e-5
Text Generation
• 2B • Updated
• 1
a-F1/Qwen2.5-1.5B-Instruct-SFT-OpenR1-LR_5e-6
Text Generation
• 2B • Updated
• 1
a-F1/Qwen-1.5B-SFT-OpenR1-LR_2e-5
Text Generation
• 2B • Updated
• 1
a-F1/Qwen-1.5B-SFT-OpenR1-LR_5e-5
Text Generation
• 2B • Updated
• 1
a-F1/Qwen-1.5B-SFT-OpenR1-LR_1e-5
Text Generation
• 2B • Updated
• 1
a-F1/Qwen-1.5B-SFT-OpenR1-LR_5e-6
Text Generation
• 2B • Updated
• 1
a-F1/Qwen-1.5B-SFT-OpenR1
2B • Updated
• 1
Text Generation
• 8B • Updated
• 1
• 1
8B • Updated
a-F1/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
• 1
a-F1/Qwen2.5-Math-1.5B-Open-R1-Distill-bi
Text Generation
• 2B • Updated
a-F1/Qwen2.5-Math-1.5B-Open-R1-Distill-mixed
Text Generation
• 2B • Updated
a-F1/Qwen2.5-Math-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
a-F1/Qwen2.5-1.5B-Open-R1-Distill-bi
Text Generation
• 2B • Updated
• 1
a-F1/Qwen2.5-1.5B-Open-R1-Distill-mixed
Text Generation
• 2B • Updated
• 1
a-F1/Qwen2.5-7B-Open-R1-Distill-mixed
Updated
a-F1/Qwen2.5-7B-Open-R1-Distill-bi
Text Generation
• 8B • Updated
• 1
Text Generation
• 7B • Updated
• 1
Text Generation
• 7B • Updated
a-F1/SimNPO_TOFU_Forget10
Text Generation
• 7B • Updated
• 1
a-F1/SimNPO_TOFU_Forget05
Text Generation
• 7B • Updated