Reinforcement Learning
Transformers
Safetensors
t5
text2text-generation
trl
text-generation-inference
Instructions to use davidgaofc/PPO_base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use davidgaofc/PPO_base with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("davidgaofc/PPO_base") model = AutoModelForSeq2SeqLM.from_pretrained("davidgaofc/PPO_base") - Notebooks
- Google Colab
- Kaggle
Ctrl+K