Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kangdawei
/
MMR-Sigmoid-DAPO
like
0
Text Generation
Transformers
Safetensors
knoveleng/open-rs
qwen2
Generated from Trainer
open-r1
dapo
trl
conversational
text-generation-inference
arxiv:
2503.14476
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
MMR-Sigmoid-DAPO
Commit History
End of training
5da64fd
verified
kangdawei
commited on
Dec 26, 2025
Model save
6e75472
verified
kangdawei
commited on
Dec 26, 2025
Training in progress, step 500
189e51a
verified
kangdawei
commited on
Dec 26, 2025
Training in progress, step 450
7b26c1c
verified
kangdawei
commited on
Dec 26, 2025
Training in progress, step 400
89c1543
verified
kangdawei
commited on
Dec 26, 2025
Training in progress, step 350
0632597
verified
kangdawei
commited on
Dec 26, 2025
Training in progress, step 300
f19cf0f
verified
kangdawei
commited on
Dec 26, 2025
Training in progress, step 250
3425b52
verified
kangdawei
commited on
Dec 26, 2025
Training in progress, step 200
98fdfc9
verified
kangdawei
commited on
Dec 26, 2025
Training in progress, step 150
d4aabaf
verified
kangdawei
commited on
Dec 26, 2025
Training in progress, step 100
8f11be7
verified
kangdawei
commited on
Dec 26, 2025
Training in progress, step 50
b7b4a12
verified
kangdawei
commited on
Dec 25, 2025
Training in progress, step 320
d70b628
verified
kangdawei
commited on
Dec 21, 2025
Training in progress, step 310
5d9e514
verified
kangdawei
commited on
Dec 21, 2025
Training in progress, step 300
4c74482
verified
kangdawei
commited on
Dec 21, 2025
Training in progress, step 290
d1067cc
verified
kangdawei
commited on
Dec 21, 2025
Training in progress, step 280
05b6106
verified
kangdawei
commited on
Dec 21, 2025
Training in progress, step 270
3dd9983
verified
kangdawei
commited on
Dec 21, 2025
Training in progress, step 260
71c82f3
verified
kangdawei
commited on
Dec 21, 2025
Training in progress, step 250
3cd00b8
verified
kangdawei
commited on
Dec 21, 2025
Training in progress, step 240
a02dfcd
verified
kangdawei
commited on
Dec 21, 2025
Training in progress, step 230
2b56d3a
verified
kangdawei
commited on
Dec 21, 2025
Training in progress, step 220
e722d87
verified
kangdawei
commited on
Dec 21, 2025
Training in progress, step 210
deac8b3
verified
kangdawei
commited on
Dec 21, 2025
End of training
17b72b2
verified
kangdawei
commited on
Dec 19, 2025
Model save
465842c
verified
kangdawei
commited on
Dec 19, 2025
Training in progress, step 200
12275e8
verified
kangdawei
commited on
Dec 19, 2025
Training in progress, step 190
f83d0f1
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 180
ceef96c
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 170
110fed2
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 160
3125505
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 150
39f7256
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 140
25e5e52
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 130
32cb0cc
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 120
8fcfb20
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 110
66c90a6
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 100
430e006
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 90
d5fa436
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 80
3424513
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 70
0ae19af
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 60
2f4bb35
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 50
105e4cb
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 40
f3ad7fd
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 30
f4dee71
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 20
6311804
verified
kangdawei
commited on
Dec 18, 2025
Training in progress, step 10
d8927fc
verified
kangdawei
commited on
Dec 18, 2025
initial commit
0b422a3
verified
kangdawei
commited on
Dec 18, 2025