Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Guardrium
/

spicy-motivator-ppo

Reinforcement Learning

Model card Files Files and versions

Instructions to use Guardrium/spicy-motivator-ppo with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Guardrium/spicy-motivator-ppo with PEFT:

from peft import PeftModel
from transformers import AutoModelForCausalLM

base_model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-3.1-8B")
model = PeftModel.from_pretrained(base_model, "Guardrium/spicy-motivator-ppo")

Notebooks
Google Colab
Kaggle

spicy-motivator-ppo

185 MB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

Guardrium's picture

Upload folder using huggingface_hub

fa2c63d verified 5 months ago

.gitattributes

1.57 kB
Upload folder using huggingface_hub 5 months ago
README.md

1.38 kB
Upload folder using huggingface_hub 5 months ago
adapter_config.json

1.05 kB
Upload folder using huggingface_hub 5 months ago
adapter_model.safetensors

168 MB
xet

Upload folder using huggingface_hub 5 months ago
special_tokens_map.json

335 Bytes
Upload folder using huggingface_hub 5 months ago
tokenizer.json

17.2 MB
xet

Upload folder using huggingface_hub 5 months ago
tokenizer_config.json

50.6 kB
Upload folder using huggingface_hub 5 months ago