Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
LeTue09
/
arithmetic-grpo
like
0
arxiv:
14 papers
Model card
Files
Files and versions
xet
Community
main
arithmetic-grpo
/
verl
/
workers
/
config
85.1 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
LeTue09
initial clean commit
1faccd4
28 days ago
__init__.py
1.09 kB
initial clean commit
28 days ago
actor.py
16.3 kB
initial clean commit
28 days ago
critic.py
11.7 kB
initial clean commit
28 days ago
engine.py
18.9 kB
initial clean commit
28 days ago
megatron_peft.py
5 kB
initial clean commit
28 days ago
model.py
8.14 kB
initial clean commit
28 days ago
optimizer.py
8.3 kB
initial clean commit
28 days ago
reward.py
3.84 kB
initial clean commit
28 days ago
rollout.py
11.9 kB
initial clean commit
28 days ago