·
AI & ML interests
None yet
Organizations
SWY666/SimPO_adjusted_Best3_Qwen
3B
•
Updated
•
4
SWY666/SimPO_adjusted_Best13_Qwen
3B
•
Updated
•
7
SWY666/SimPO_adjusted_Best3-2
Updated
SWY666/SimPO_adjusted_Best3
Updated
SWY666/Qwen-2.5-7B-Simple-RL-with-reward-model-pure-debug
Updated
SWY666/Qwen-2.5-7B-Simple-RL-with-reward-model-pure
Updated
SWY666/Qwen-2.5-7B-Simple-RL-with-reward-model
Text Generation
•
8B
•
Updated
•
7
SWY666/Qwen-2.5-7B-Simple-RL-debug
Updated
SWY666/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
•
7
Text Generation
•
3B
•
Updated
•
108
SWY666/Qwen2.5-1.5B-Open-R1-GRPO
Updated
SWY666/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated