Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

MilaWang
/
grpo-fullparam-sciknoweval-physics

Safetensors
Model card Files Files and versions
xet
Community
grpo-fullparam-sciknoweval-physics
Ctrl+K
Ctrl+K
  • 1 contributor
History: 122 commits
MilaWang's picture
MilaWang
Upload rollout_generations/95.jsonl
a26619b verified 7 days ago
  • best_ckpt_step_10
    Upload best_ckpt_step_10/policy 7 days ago
  • best_ckpt_step_20
    Upload best_ckpt_step_20/policy 7 days ago
  • best_ckpt_step_30
    Upload best_ckpt_step_30/policy 7 days ago
  • best_ckpt_step_40
    Upload best_ckpt_step_40/policy 7 days ago
  • best_ckpt_step_50
    Upload best_ckpt_step_50/policy 7 days ago
  • best_ckpt_step_70
    Upload best_ckpt_step_70/policy 7 days ago
  • best_ckpt_step_80
    Upload best_ckpt_step_80/policy 7 days ago
  • rollout_generations
    Upload rollout_generations/95.jsonl 7 days ago
  • val_generations
    Upload val_generations/90.jsonl 7 days ago
  • .gitattributes
    2.13 kB
    Upload val_generations/90.jsonl 7 days ago