Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
CMU-POPE
/
HARD-DAPO-gemini_random-4-guide_no-guide_backup
like
0
Follow
Privileged On-Policy Exploration
2
Safetensors
qwen3
Model card
Files
Files and versions
xet
Community
main
HARD-DAPO-gemini_random-4-guide_no-guide_backup
Commit History
Training in progress, step 10
e8c42bb
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 10
a6d17df
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 140
565cc26
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 130
05e2840
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 120
7968eab
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 110
d2b194a
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 100
c3ba6ed
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 90
d059d5f
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 80
c01dea5
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 70
e90a433
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 60
6acb39a
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 50
2b4fab2
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 40
331b7f9
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 30
bfaf1e6
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 20
10b3180
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 10
05707ea
verified
CohenQu
commited on
Sep 22, 2025
initial commit
e38af97
verified
CohenQu
commited on
Sep 22, 2025