Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
LeTue09
/
arithmetic-grpo
like
0
arxiv:
14 papers
Model card
Files
Files and versions
xet
Community
main
arithmetic-grpo
/
examples
/
data_preprocess
73.5 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
LeTue09
initial clean commit
1faccd4
28 days ago
aime2024_multiturn_w_tool.py
2.86 kB
initial clean commit
28 days ago
aime_dataset.py
4.48 kB
initial clean commit
28 days ago
aime_history_dataset.py
6.02 kB
initial clean commit
28 days ago
dapo_multiturn_w_tool.py
2.86 kB
initial clean commit
28 days ago
full_hh_rlhf.py
5.99 kB
initial clean commit
28 days ago
geo3k.py
3.55 kB
initial clean commit
28 days ago
geo3k_multiturn_w_tool.py
4.71 kB
initial clean commit
28 days ago
gsm8k.py
3.64 kB
initial clean commit
28 days ago
gsm8k_multiturn_sft.py
3.37 kB
initial clean commit
28 days ago
gsm8k_multiturn_w_interaction.py
4.39 kB
initial clean commit
28 days ago
gsm8k_multiturn_w_tool.py
4.96 kB
initial clean commit
28 days ago
gsm8k_tool_agent_loop.py
5.01 kB
initial clean commit
28 days ago
hellaswag.py
3.92 kB
initial clean commit
28 days ago
math_dataset.py
3.86 kB
initial clean commit
28 days ago
multiturn.py
4.67 kB
initial clean commit
28 days ago
pokemon.py
2.31 kB
initial clean commit
28 days ago
preprocess_search_r1_dataset.py
6.94 kB
initial clean commit
28 days ago