·
AI & ML interests
None yet
Organizations
None yet
leonMW/DeepSeek-R1-Distill-Qwen-14B-GSPO-Basic-Easy
Text Generation
• 15B • Updated • 1
leonMW/Qwen3-4B-Thinking-2507-GSPO-Basic-Easy
Text Generation
• 4B • Updated leonMW/Qwen3-4B-Thinking-2507-GSPO-Easy
Text Generation
• 4B • Updated • 8
leonMW/Qwen3-4B-Thinking-2507-GSPO-Basic
Text Generation
• 4B • Updated • 7
leonMW/DeepSeek-R1-Distill-Qwen-14B-GSPO-Easy
Text Generation
• 15B • Updated leonMW/DeepSeek-R1-Distill-Qwen-1.5B-GSPO-Easy
Text Generation
• 2B • Updated leonMW/DeepSeek-R1-Distill-Qwen-7B-GSPO-Basic-Easy
Text Generation
• 8B • Updated leonMW/Qwen3-4B-Thinking-2507-GSPO-Medium
Updated
leonMW/Qwen3-4B-Thinking-2507-GSPO-Medium_Cutoff
Updated
leonMW/DeepSeek-R1-Distill-Qwen-14B-GSPO-Basic
Text Generation
• 15B • Updated • 1
leonMW/DeepSeek-R1-Distill-Qwen-7B-GSPO-Basic
Text Generation
• 8B • Updated • 1
• 1
leonMW/DeepSeek-R1-Distill-Qwen-7B-GSPO-Easy
Text Generation
• 8B • Updated leonMW/DeepSeek-R1-Distill-Qwen-1.5B-GSPO-Basic-Easy
Text Generation
• 2B • Updated leonMW/DeepSeek-R1-Distill-Qwen-1.5B-GSPO-Basic
Text Generation
• 2B • Updated • 53
leonMW/DeepSeek-R1-Distill-Qwen-1.5B-long-context-Staged-4-10
Text Generation
• 2B • Updated • 1
leonMW/DeepSeek-R1-Distill-Qwen-1.5B-long-context-Staged-1-6
Text Generation
• 2B • Updated • 2
leonMW/DeepSeek-R1-Distill-Qwen-1.5B-long-context-Staged-4
Text Generation
• 2B • Updated • 2
leonMW/Qwen3-4B-Thinking-2507-Staged-2
Text Generation
• 196k • Updated • 11
leonMW/DeepSeek-R1-Distill-Qwen-1.5B-long-context-Staged-3
Text Generation
• 2B • Updated • 1
leonMW/DeepSeek-R1-Distill-Qwen-1.5B-long-context-Staged-2
Text Generation
• 2B • Updated • 3
leonMW/DeepSeek-R1-Distill-Qwen-1.5B-long-context-Staged-1
Text Generation
• 2B • Updated • 3
leonMW/Qwen3-4B-Thinking-2507-Staged-1
Text Generation
• 196k • Updated • 6
leonMW/Qwen3-4B-Thinking-2507-Staged-4
Text Generation
• 196k • Updated • 11
leonMW/Qwen3-4B-Thinking-2507-Staged-3
Text Generation
• 196k • Updated • 5
leonMW/Qwen3-8B-GSPO-Easy
Text Generation
• 308k • Updated • 22
leonMW/Qwen3-14B-GSPO-Easy
Updated
leonMW/Qwen3-4B-Thinking-2507-GSPO-Easy-Test
Updated
leonMW/Unsloth-Qwen3-4B-Base-GSPO-Easy
Updated
leonMW/DeepSeek-R1-Distill-Qwen-14B-GRPO-Easy
Updated
leonMW/unsloth-gpt-oss-20B-LORA-GSPO-Basic
Updated