Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

samhitha2601
/

llama3.2-3b-ppo

Reinforcement Learning

text-generation

Model card Files Files and versions

Instructions to use samhitha2601/llama3.2-3b-ppo with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use samhitha2601/llama3.2-3b-ppo with Transformers:

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("samhitha2601/llama3.2-3b-ppo")
model = AutoModelForCausalLM.from_pretrained("samhitha2601/llama3.2-3b-ppo")

Notebooks
Google Colab
Kaggle

llama3.2-3b-ppo

17.3 MB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

samhitha2601's picture

Upload checkpoint from step 467

58a60b1 verified 9 months ago

.gitattributes

1.57 kB
Upload checkpoint from step 467 9 months ago
README.md

3.66 kB
Upload checkpoint from step 467 9 months ago
config.json

904 Bytes
Upload checkpoint from step 467 9 months ago
generation_config.json

184 Bytes
Upload checkpoint from step 467 9 months ago
special_tokens_map.json

325 Bytes
Upload checkpoint from step 467 9 months ago
tokenizer.json

17.2 MB
xet

Upload checkpoint from step 467 9 months ago
tokenizer_config.json

54.6 kB
Upload checkpoint from step 467 9 months ago