Text Generation
PEFT
Safetensors
English
reinforcement-learning
rlhf
ppo
small-language-models
lora
slm
agents
Instructions to use mr3haque/SLM-RL-Agents with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use mr3haque/SLM-RL-Agents with PEFT:
Task type is invalid.
- Notebooks
- Google Colab
- Kaggle
Welcome to the community
The community tab is the place to discuss and collaborate with the HF community!