·
AI & ML interests
None yet
Organizations
None yet
LuyiCui/slow_fast_reason-sft-s1k-1.1_full
Text Generation
• 8B • Updated • 2
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-SAPO
2B • Updated • 1
LuyiCui/sft-amc_aime-R1-Distill-Qwen-1.5B
2B • Updated • 1
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
LuyiCui/Qwen2.5-1.5B-Instruct-CEPO
Text Generation
• 2B • Updated • 3
LuyiCui/Qwen2.5-Math-1.5B-GRPO
Updated
LuyiCui/Qwen2.5-1.5B-GRPO
Updated
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-123
Text Generation
• 2B • Updated • 8
LuyiCui/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated • 3
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-3
Updated
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-2-2
Text Generation
• 2B • Updated • 8
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-2
Text Generation
• 2B • Updated • 5
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-1
Text Generation
• 2B • Updated • 6
Feature Extraction
• 3B • Updated • 1
Feature Extraction
• 2B • Updated • 1
Text Generation
• 0.5B • Updated • 3
LuyiCui/Qwen2.5-1.5B-Open-R1-Distill
Updated
Feature Extraction
• 2B • Updated • 1
Feature Extraction
• 0.5B • Updated • 1
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO
Text Generation
• 2B • Updated • 7
• 1