Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
CMU-POPE
/
HARD-ALL-gemini_random-4-guide_no-guide-backup
like
0
Follow
Privileged On-Policy Exploration
2
Safetensors
qwen3
Model card
Files
Files and versions
xet
Community
main
HARD-ALL-gemini_random-4-guide_no-guide-backup
Commit History
Training in progress, step 160
c9b4b5e
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 150
40b1f03
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 140
d7c34fa
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 130
788ac11
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 120
8e58454
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 110
f94d8cd
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 100
aad251f
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 90
e699629
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 80
a56e1ed
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 70
083b2da
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 60
f0418c9
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 50
1f7e865
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 40
d8444da
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 30
2927047
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 20
a1740ad
verified
CohenQu
commited on
Sep 22, 2025
Training in progress, step 10
006b3b4
verified
CohenQu
commited on
Sep 22, 2025
Add tokenizer from base model
77787ea
verified
CohenQu
commited on
Sep 22, 2025
initial commit
ff6438f
verified
CohenQu
commited on
Sep 22, 2025