Post
1730
TRL now includes agent training support for GRPOโผ๏ธ
Train ๐ต๏ธ agents with ๐ง tools, enabling interaction with external functions and APIs.
And of course, a new notebook and scripts to get you up to speed
๐ notebook tutorial: https://github.com/huggingface/trl/blob/main/examples/notebooks/grpo_agent.ipynb
๐ script examples: https://github.com/huggingface/trl/blob/main/examples/scripts/grpo_agent.py
๐ฆ TRL v0.26.0 release: https://github.com/huggingface/trl/releases/tag/v0.26.0
Train ๐ต๏ธ agents with ๐ง tools, enabling interaction with external functions and APIs.
And of course, a new notebook and scripts to get you up to speed
๐ notebook tutorial: https://github.com/huggingface/trl/blob/main/examples/notebooks/grpo_agent.ipynb
๐ script examples: https://github.com/huggingface/trl/blob/main/examples/scripts/grpo_agent.py
๐ฆ TRL v0.26.0 release: https://github.com/huggingface/trl/releases/tag/v0.26.0