Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
kangdawei
/
MMR-Adaptive-Smooth-DR_GRPO
like
0
Safetensors
qwen2
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
MMR-Adaptive-Smooth-DR_GRPO
Commit History
Training in progress, step 100
dfb59a7
verified
kangdawei
commited on
Oct 25, 2025
Training in progress, step 50
6f155c0
verified
kangdawei
commited on
Oct 25, 2025
Training in progress, step 450
e192dce
verified
kangdawei
commited on
Oct 8, 2025
Training in progress, step 400
cd01b84
verified
kangdawei
commited on
Oct 8, 2025
Training in progress, step 350
2eb11c3
verified
kangdawei
commited on
Oct 8, 2025
Training in progress, step 300
0c4c29d
verified
kangdawei
commited on
Oct 8, 2025
Training in progress, step 250
43e3642
verified
kangdawei
commited on
Oct 8, 2025
Training in progress, step 200
191eba2
verified
kangdawei
commited on
Oct 8, 2025
Training in progress, step 150
21ab48f
verified
kangdawei
commited on
Oct 8, 2025
Training in progress, step 100
83d3eca
verified
kangdawei
commited on
Oct 7, 2025
Training in progress, step 50
ca32995
verified
kangdawei
commited on
Oct 7, 2025
initial commit
d44a8e4
verified
kangdawei
commited on
Oct 7, 2025