Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
goyalayus
/
arithmetic-2digit-rl_2digit_06b
like
0
Transformers
Safetensors
Generated from Trainer
unsloth
grpo
trl
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
arithmetic-2digit-rl_2digit_06b
87.4 MB
Ctrl+K
Ctrl+K
1 contributor
History:
7 commits
goyalayus
Training in progress, step 150, checkpoint
14b1ade
verified
3 days ago
last-checkpoint
Training in progress, step 150, checkpoint
3 days ago
.gitattributes
Safe
1.64 kB
Training in progress, step 50, checkpoint
3 days ago
README.md
2.32 kB
Training in progress, step 50
3 days ago
adapter_config.json
1.13 kB
Training in progress, step 50
3 days ago
adapter_model.safetensors
22 MB
xet
Training in progress, step 150
3 days ago
added_tokens.json
Safe
734 Bytes
Training in progress, step 50
3 days ago
chat_template.jinja
Safe
4.91 kB
Training in progress, step 50
3 days ago
merges.txt
Safe
1.67 MB
Training in progress, step 50
3 days ago
special_tokens_map.json
Safe
499 Bytes
Training in progress, step 50
3 days ago
tokenizer.json
Safe
11.4 MB
xet
Training in progress, step 50
3 days ago
tokenizer_config.json
5.61 kB
Training in progress, step 50
3 days ago
training_args.bin
7.76 kB
xet
Training in progress, step 50
3 days ago
vocab.json
Safe
2.78 MB
Training in progress, step 50
3 days ago