Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
shahidul034
/
readCtrl_lambda
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
readCtrl_lambda
/
code
/
RL_model
/
verl
/
verl_train
/
tests
/
special_e2e
98.6 kB
Ctrl+K
Ctrl+K
3 contributors
History:
1 commit
mshahidul
Initial commit of readCtrl code without large models
030876e
8 days ago
envs
Initial commit of readCtrl code without large models
8 days ago
generation
Initial commit of readCtrl code without large models
8 days ago
ppo_trainer
Initial commit of readCtrl code without large models
8 days ago
sft
Initial commit of readCtrl code without large models
8 days ago
README.md
83 Bytes
Initial commit of readCtrl code without large models
8 days ago
__init__.py
Safe
600 Bytes
Initial commit of readCtrl code without large models
8 days ago
check_custom_rwd_fn.py
1.17 kB
Initial commit of readCtrl code without large models
8 days ago
check_results.py
1.75 kB
Initial commit of readCtrl code without large models
8 days ago
run_dapo.sh
3.57 kB
Initial commit of readCtrl code without large models
8 days ago
run_fully_async_policy.sh
7.86 kB
Initial commit of readCtrl code without large models
8 days ago
run_geo3k_fsdp_sgl_multiturn_w_tool.sh
2.61 kB
Initial commit of readCtrl code without large models
8 days ago
run_grpo_lora_with_merge.sh
3.59 kB
Initial commit of readCtrl code without large models
8 days ago
run_gsm8k_fsdp_sgl_multiturn_sf_tool.sh
2.52 kB
Initial commit of readCtrl code without large models
8 days ago
run_gsm8k_fsdp_sgl_multiturn_w_tool.sh
2.6 kB
Initial commit of readCtrl code without large models
8 days ago
run_one_step_off_policy.sh
7 kB
Initial commit of readCtrl code without large models
8 days ago
run_ppo_trainer_megatron.sh
12.5 kB
Initial commit of readCtrl code without large models
8 days ago
run_test.sh
359 Bytes
Initial commit of readCtrl code without large models
8 days ago
run_transferqueue.sh
7.22 kB
Initial commit of readCtrl code without large models
8 days ago