Albert Villanova del Moral
AI & ML interests
ML Engineer @ Hugging Face: Agents (Science)
Recent Activity
posted an
update
1 day ago
๐ TRL v0.29.0 introduces trl-training: an agent-native training skill.
This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)
Weโre excited to see what the community builds on top of this.
If youโre working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! ๐ค
The future of ML tooling is agent-native.
๐ https://github.com/huggingface/trl/releases/tag/v0.29.0 new activity
4 days ago
code-search-net/code_search_net:Convert dataset to Parquet liked
a Space 12 days ago
lm-provers/qed-nano-blogpost