Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
arvindcr4
/
llama-3.2-1b-arithmetic-rl-lora
like
0
Reinforcement Learning
PEFT
Safetensors
tinker
math
lora
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
main
llama-3.2-1b-arithmetic-rl-lora
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
arvindcr4
Upload README.md with huggingface_hub
867ad35
verified
21 days ago
.gitattributes
Safe
1.52 kB
initial commit
21 days ago
README.md
1.22 kB
Upload README.md with huggingface_hub
21 days ago
adapter_config.json
757 Bytes
Add Tinker RL-trained arithmetic LoRA adapter (Llama-3.2-1B)
21 days ago
adapter_model.safetensors
107 MB
xet
Add Tinker RL-trained arithmetic LoRA adapter (Llama-3.2-1B)
21 days ago
checkpoint_complete
Safe
0 Bytes
Add Tinker RL-trained arithmetic LoRA adapter (Llama-3.2-1B)
21 days ago