🏗️ Building on HF

Sergio Paniego PRO

sergiopaniego

huggingface

·

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

updated a dataset about 10 hours ago

agents-course/final-certificates

updated a dataset about 10 hours ago

agents-course/course-certificates-of-excellence

updated a dataset 3 days ago

huggingface-projects/Deep-RL-Course-Certification

View all activity

Organizations

buckets 24

sergiopaniego/trl-text-to-sql-trackio-bucket

sergiopaniego/trl-text-to-sql-static-f7b321-bucket

sergiopaniego/huggingface-static-343d67-bucket

sergiopaniego/qwen-sql-demo-bucket

sergiopaniego/qwen25-sql-qlora-demo-bucket

sergiopaniego/qwen25-sql-qlora-static-c26244-bucket

View 24 buckets

Posts 99

Post

150

TRL v1.7.0 is out‼️

+ continuous batching makes GRPO and RLOO 1.25x faster at -16 GB
+ proper MoE post-training across GRPO/RLOO/AsyncGRPO
+ new GMPO trainer
+ AsyncGRPO weight sync + padding-free
+ more

https://github.com/huggingface/trl/releases/tag/v1.7.0

wrote a small article about the continuous batching for GRPO feature

https://huggingface.co/blog/sergiopaniego/cb-trl-grpo

Articles 23

Article

8

Continuous batching for GRPO, now in TRL

View all Articles

Collections 9

View 9 collections

spaces 144

VLM Object Understanding

Explore object detection, visual grounding, keypoint Detecti

Qwen2-VL-7B

Ask questions about charts in images

SmolVLM-trl-dpo-rlaif-v

Generate text from an image and question

SmolVLM-trl-sft-ChartQA

Ask questions about charts in images

Trl Text To Sql Trackio

Show a live I/O tracking dashboard

Qwen Sql Demo

Display real-time I/O tracking dashboard

View 144 Spaces

models 126

sergiopaniego/Qwen2.5-0.5B-Instruct-text-to-sql-qlora

Updated 14 days ago

sergiopaniego/browsergym-grpo-functiongemma-270m-it

Text Generation • 0.3B • Updated May 29 • 5 • 2

sergiopaniego/qwen3-grpo-requests

sergiopaniego/reasoning-gym-chain-sum-Qwen3-1.7B-sft

Text Generation • 2B • Updated May 4 • 6

sergiopaniego/reasoning-gym-chain-sum-Qwen3-1.7B

Text Generation • 2B • Updated Apr 28 • 41

sergiopaniego/carla-vlm-gemma-test

sergiopaniego/carla-vlm-qwen35-test

sergiopaniego/carla-vlm-gemma

sergiopaniego/carla-vlm-qwen35

sergiopaniego/nemotron-3-sft

View 126 models

datasets 9

sergiopaniego/requests-pr-diff

Viewer • Updated May 19 • 1 • 50

sergiopaniego/trl-r2e-test

Viewer • Updated May 18 • 1 • 12

sergiopaniego/chain-sum-rollouts

Viewer • Updated May 4 • 50 • 19

sergiopaniego/ttt-scripted-smoke

Viewer • Updated Apr 17 • 20 • 15

sergiopaniego/sample_videos

Viewer • Updated Jun 30, 2025 • 2 • 11

sergiopaniego/difficult_prompts

Viewer • Updated Jun 20, 2025 • 38 • 26

sergiopaniego/ourworldindata_example

Viewer • Updated Dec 2, 2024 • 13 • 52 • 1

sergiopaniego/faiss_embeddings

Updated Oct 3, 2024 • 10

sergiopaniego/CarlaFollowLanePreviousV

Viewer • Updated Sep 6, 2023 • 59.6k • 16