Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MilaWang
/
grpo-fullparam-sciknoweval-physics
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
grpo-fullparam-sciknoweval-physics
Ctrl+K
Ctrl+K
1 contributor
History:
122 commits
MilaWang
Upload rollout_generations/95.jsonl
a26619b
verified
7 days ago
best_ckpt_step_10
Upload best_ckpt_step_10/policy
7 days ago
best_ckpt_step_20
Upload best_ckpt_step_20/policy
7 days ago
best_ckpt_step_30
Upload best_ckpt_step_30/policy
7 days ago
best_ckpt_step_40
Upload best_ckpt_step_40/policy
7 days ago
best_ckpt_step_50
Upload best_ckpt_step_50/policy
7 days ago
best_ckpt_step_70
Upload best_ckpt_step_70/policy
7 days ago
best_ckpt_step_80
Upload best_ckpt_step_80/policy
7 days ago
rollout_generations
Upload rollout_generations/95.jsonl
7 days ago
val_generations
Upload val_generations/90.jsonl
7 days ago
.gitattributes
Safe
2.13 kB
Upload val_generations/90.jsonl
7 days ago