Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

arvindcr4
/

llama-3.2-1b-arithmetic-rl-lora

Reinforcement Learning

Model card Files Files and versions

Instructions to use arvindcr4/llama-3.2-1b-arithmetic-rl-lora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use arvindcr4/llama-3.2-1b-arithmetic-rl-lora with PEFT:

from peft import PeftModel
from transformers import AutoModelForCausalLM

base_model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-3.2-1B")
model = PeftModel.from_pretrained(base_model, "arvindcr4/llama-3.2-1b-arithmetic-rl-lora")

Notebooks
Google Colab
Kaggle

llama-3.2-1b-arithmetic-rl-lora

107 MB

Ctrl+K

Ctrl+K

1 contributor

History: 3 commits

arvindcr4's picture

Upload README.md with huggingface_hub

867ad35 verified 4 months ago

.gitattributes

1.52 kB
initial commit 4 months ago
README.md

1.22 kB
Upload README.md with huggingface_hub 4 months ago
adapter_config.json

757 Bytes
Add Tinker RL-trained arithmetic LoRA adapter (Llama-3.2-1B) 4 months ago
adapter_model.safetensors

107 MB
xet

Add Tinker RL-trained arithmetic LoRA adapter (Llama-3.2-1B) 4 months ago
checkpoint_complete

0 Bytes
Add Tinker RL-trained arithmetic LoRA adapter (Llama-3.2-1B) 4 months ago