Albert Villanova del Moral

albertvillanova

huggingface

·

https://albertvillanova.github.io/

AI & ML interests

ML Engineer @ Hugging Face: Agents (Science)

Recent Activity

reacted to sergiopaniego's post with ❤️ 9 days ago

Frontier models use distillation as a step of their post-training pipelines. In 2026 it has three jobs: compress a big model into a small one, merge RL experts into a single model, and let a model teach itself. I wrote up which frontier models use each one and how: https://huggingface.co/blog/sergiopaniego/distillation-2026 It pairs with Class 2 of the Training an Agent series Ben and I are doing, where we teach these techniques hands-on with TRL!

reacted to sergiopaniego's post with 🔥 9 days ago

Frontier models use distillation as a step of their post-training pipelines. In 2026 it has three jobs: compress a big model into a small one, merge RL experts into a single model, and let a model teach itself. I wrote up which frontier models use each one and how: https://huggingface.co/blog/sergiopaniego/distillation-2026 It pairs with Class 2 of the Training an Agent series Ben and I are doing, where we teach these techniques hands-on with TRL!

reacted to sergiopaniego's post with 👍 9 days ago

Frontier models use distillation as a step of their post-training pipelines. In 2026 it has three jobs: compress a big model into a small one, merge RL experts into a single model, and let a model teach itself. I wrote up which frontier models use each one and how: https://huggingface.co/blog/sergiopaniego/distillation-2026 It pairs with Class 2 of the Training an Agent series Ben and I are doing, where we teach these techniques hands-on with TRL!

View all activity

Organizations

published an article about 2 months ago

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

+6

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego

•

May 27

• 43

published an article 4 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 173

published an article 9 months ago

Article

TIL: How a Harmless Refactor Exposed a Hidden CUDA Bug in Vision-Language Models

albertvillanova

•

Oct 22, 2025

published an article about 1 year ago

Article

TinyAgents: A Minimal Experiment with Code Agents and MCP Tools

albertvillanova

•

May 16, 2025

• 30

published an article over 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

+3

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

published an article over 1 year ago

Article

We now support VLMs in smolagents!

+1

m-ric, merve, albertvillanova

•

Jan 24, 2025

• 114

published an article over 1 year ago

Article

CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard

+2

alozowski, SaylorTwift, albertvillanova, clefourrier

•

Jan 9, 2025

• 20

published an article over 1 year ago

Article

CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard

+2

alozowski, SaylorTwift, albertvillanova, clefourrier

•

Jan 9, 2025

• 20