Article
Albert Villanova del Moral
AI & ML interests
ML Engineer @ Hugging Face: Agents (Science)
Recent Activity
posted an
update
1 day ago
🚀 TRL v0.29.0 introduces trl-training: an agent-native training skill.
This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)
We’re excited to see what the community builds on top of this.
If you’re working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! 🤗
The future of ML tooling is agent-native.
🔗 https://github.com/huggingface/trl/releases/tag/v0.29.0 new activity
4 days ago
code-search-net/code_search_net:Convert dataset to Parquet liked
a Space 12 days ago
lm-provers/qed-nano-blogpost