Post
4
TRL now includes agent training support for GRPOβΌοΈ
Train π΅οΈ agents with π§ tools, enabling interaction with external functions and APIs.
And of course, a new notebook and scripts to get you up to speed
π notebook tutorial: https://github.com/huggingface/trl/blob/main/examples/notebooks/grpo_agent.ipynb
π script examples: https://github.com/huggingface/trl/blob/main/examples/scripts/grpo_agent.py
π¦ TRL v0.26.0 release: https://github.com/huggingface/trl/releases/tag/v0.26.0
Train π΅οΈ agents with π§ tools, enabling interaction with external functions and APIs.
And of course, a new notebook and scripts to get you up to speed
π notebook tutorial: https://github.com/huggingface/trl/blob/main/examples/notebooks/grpo_agent.ipynb
π script examples: https://github.com/huggingface/trl/blob/main/examples/scripts/grpo_agent.py
π¦ TRL v0.26.0 release: https://github.com/huggingface/trl/releases/tag/v0.26.0