Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

haifasyn
/

rlhf-ppo

Generated from Trainer

Model card Files Files and versions

Instructions to use haifasyn/rlhf-ppo with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use haifasyn/rlhf-ppo with Transformers:

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("haifasyn/rlhf-ppo", dtype="auto")

Notebooks
Google Colab
Kaggle

15.8 MB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

haifasyn's picture

End of training

83c4cb7 verified 5 months ago

.gitattributes

1.57 kB
End of training 5 months ago
README.md

1.26 kB
End of training 5 months ago
adapter_config.json

979 Bytes
End of training 5 months ago
adapter_model.safetensors

4.34 MB
xet

End of training 5 months ago
chat_template.jinja

2.51 kB
End of training 5 months ago
tokenizer.json

11.4 MB
xet

End of training 5 months ago
tokenizer_config.json

662 Bytes
End of training 5 months ago
training_args.bin

5.46 kB
xet

End of training 5 months ago