Albert Villanova del Moral

albertvillanova

huggingface

·

https://albertvillanova.github.io/

AI & ML interests

ML Engineer @ Hugging Face: Agents (Science)

Recent Activity

reacted to sergiopaniego's post with ❤️ 9 days ago

Frontier models use distillation as a step of their post-training pipelines. In 2026 it has three jobs: compress a big model into a small one, merge RL experts into a single model, and let a model teach itself. I wrote up which frontier models use each one and how: https://huggingface.co/blog/sergiopaniego/distillation-2026 It pairs with Class 2 of the Training an Agent series Ben and I are doing, where we teach these techniques hands-on with TRL!

reacted to sergiopaniego's post with 🔥 9 days ago

Frontier models use distillation as a step of their post-training pipelines. In 2026 it has three jobs: compress a big model into a small one, merge RL experts into a single model, and let a model teach itself. I wrote up which frontier models use each one and how: https://huggingface.co/blog/sergiopaniego/distillation-2026 It pairs with Class 2 of the Training an Agent series Ben and I are doing, where we teach these techniques hands-on with TRL!

reacted to sergiopaniego's post with 👍 9 days ago

Frontier models use distillation as a step of their post-training pipelines. In 2026 it has three jobs: compress a big model into a small one, merge RL experts into a single model, and let a model teach itself. I wrote up which frontier models use each one and how: https://huggingface.co/blog/sergiopaniego/distillation-2026 It pairs with Class 2 of the Training an Agent series Ben and I are doing, where we teach these techniques hands-on with TRL!

View all activity

Organizations

New activity in trl-internal-testing/tiny-DeepseekV3ForCausalLM-0528 16 days ago

Upload DeepseekV3ForCausalLM

#4 opened 16 days ago by

albertvillanova

New activity in trl-internal-testing/tiny-DeepseekV3ForCausalLM 16 days ago

Upload DeepseekV3ForCausalLM

#3 opened 16 days ago by

albertvillanova

New activity in trl-internal-testing/tiny-DeepseekV3ForCausalLM-0528 16 days ago

Upload DeepseekV3ForCausalLM

#2 opened 2 months ago by

Upload DeepseekV3ForCausalLM

#3 opened 2 months ago by

Upload DeepseekV3ForCausalLM

#1 opened 2 months ago by

New activity in trl-internal-testing/tiny-DeepseekV3ForCausalLM 16 days ago

Upload tiny DeepseekV3ForCausalLM

#2 opened 3 months ago by

New activity in trl-internal-testing/tiny-Qwen2_5_VLForConditionalGeneration about 2 months ago

Upload Qwen2_5_VLForConditionalGeneration

#16 opened about 2 months ago by

albertvillanova

New activity in trl-internal-testing/tiny-Qwen2VLForConditionalGeneration about 2 months ago

Upload Qwen2VLForConditionalGeneration

#6 opened 2 months ago by

albertvillanova

New activity in trl-internal-testing/tiny-Qwen2_5_VLForConditionalGeneration 2 months ago

Upload Qwen2_5_VLForConditionalGeneration

#13 opened 2 months ago by

albertvillanova

New activity in trl-internal-testing/tiny-Qwen3VLForConditionalGeneration 2 months ago

Upload Qwen3VLForConditionalGeneration

#6 opened 2 months ago by

albertvillanova

Upload Qwen3VLForConditionalGeneration

#5 opened 2 months ago by

albertvillanova

New activity in trl-internal-testing/tiny-Qwen2_5_VLForConditionalGeneration 2 months ago

Upload Qwen2_5_VLForConditionalGeneration

#12 opened 2 months ago by

albertvillanova

New activity in trl-internal-testing/tiny-Qwen2VLForConditionalGeneration 2 months ago

Upload Qwen2VLForConditionalGeneration

#5 opened 2 months ago by

albertvillanova

New activity in trl-internal-testing/tiny-Gemma4ForConditionalGeneration 2 months ago

Upload Gemma4ForConditionalGeneration

#6 opened 2 months ago by

albertvillanova

Upload Gemma4ForConditionalGeneration

#5 opened 2 months ago by

albertvillanova

Upload Gemma4ForConditionalGeneration

#4 opened 2 months ago by

albertvillanova

Upload Gemma4ForConditionalGeneration

#3 opened 2 months ago by

albertvillanova

New activity in trl-internal-testing/tiny-Gemma3ForConditionalGeneration 3 months ago

Upload Gemma3ForConditionalGeneration

#10 opened 3 months ago by

albertvillanova

Upload Gemma3ForConditionalGeneration

#9 opened 3 months ago by

albertvillanova

Upload Gemma3ForConditionalGeneration

#8 opened 3 months ago by

albertvillanova