Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
fuge
luoyedenghua
Follow
AI & ML interests
None yet
Organizations
None yet
models
7
Sort: Recently updated
luoyedenghua/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
•
2B
•
Updated
Apr 9, 2025
•
2
luoyedenghua/Qwen2.5-3B-Instruct-grpo
Text Generation
•
Updated
Mar 18, 2025
•
1
luoyedenghua/Qwen2.5-3B-Open-R1-GRPO
Updated
Mar 12, 2025
luoyedenghua/Qwen2.5-1.5B-Open-R1-GRPO
Updated
Mar 11, 2025
luoyedenghua/DeepSeek-R1-Distill-Qwen-14B-GRPO
Text Generation
•
Updated
Mar 7, 2025
•
1
luoyedenghua/Qwen-2.5-7B-Simple-RL
Updated
Mar 6, 2025
luoyedenghua/Qwen2.5-1.5B-Open-R1-Distill
Updated
Feb 21, 2025
datasets
0
None public yet