Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

shahidul034
/
readctrl

Safetensors
Model card Files Files and versions
xet
Community
readctrl / code /RL_model /verl /Search-R1 /misc /scripts /nq_hotpotqa /v0.2
  • 1 contributor
History: 1 commit
shahidul034's picture
shahidul034
Add files using upload-large-folder tool
034cb04 verified 29 days ago
  • train_grpo.sh
    3.55 kB
    Add files using upload-large-folder tool 29 days ago
  • train_ppo.sh
    3.9 kB
    Add files using upload-large-folder tool 29 days ago