Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
shahidul034
/
readCtrl_lambda
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
readCtrl_lambda
/
code
/
RL_model
/
verl
/
verl_train
/
docs
/
algo
173 kB
Ctrl+K
Ctrl+K
3 contributors
History:
1 commit
mshahidul
Initial commit of readCtrl code without large models
030876e
8 days ago
baseline.md
12 kB
Initial commit of readCtrl code without large models
8 days ago
collabllm.md
6.23 kB
Initial commit of readCtrl code without large models
8 days ago
dapo.md
10.6 kB
Initial commit of readCtrl code without large models
8 days ago
entropy.md
8 kB
Initial commit of readCtrl code without large models
8 days ago
gpg.md
1.57 kB
Initial commit of readCtrl code without large models
8 days ago
grpo.md
5.76 kB
Initial commit of readCtrl code without large models
8 days ago
opo.md
2.25 kB
Initial commit of readCtrl code without large models
8 days ago
otb.md
4.72 kB
Initial commit of readCtrl code without large models
8 days ago
ppo.md
6.89 kB
Initial commit of readCtrl code without large models
8 days ago
rollout_corr.md
52.8 kB
Initial commit of readCtrl code without large models
8 days ago
rollout_corr_math.md
47.6 kB
Initial commit of readCtrl code without large models
8 days ago
spin.md
11.5 kB
Initial commit of readCtrl code without large models
8 days ago
sppo.md
3.21 kB
Initial commit of readCtrl code without large models
8 days ago