Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

shahidul034
/
readCtrl_lambda

Safetensors
Model card Files Files and versions
xet
Community
readCtrl_lambda / code /RL_model /verl /verl_train /examples /gspo_trainer
38 kB
Ctrl+K
Ctrl+K
  • 3 contributors
History: 1 commit
mshahidul
Initial commit of readCtrl code without large models
030876e 9 days ago
  • run_qwen30b_gspo.sh
    7.63 kB
    Initial commit of readCtrl code without large models 9 days ago
  • run_qwen3_32b_gspo_npu.sh
    7.1 kB
    Initial commit of readCtrl code without large models 9 days ago
  • test_gspo_3b_math.sh
    7.92 kB
    Initial commit of readCtrl code without large models 9 days ago
  • test_gspo_3b_math_slurm.sh
    8.03 kB
    Initial commit of readCtrl code without large models 9 days ago
  • test_gspo_qwen30b_a3b_ep.sh
    7.34 kB
    Initial commit of readCtrl code without large models 9 days ago