Sergio Paniego PRO
AI & ML interests
Recent Activity
Organizations
Posts 99
+ continuous batching makes GRPO and RLOO 1.25x faster at -16 GB
+ proper MoE post-training across GRPO/RLOO/AsyncGRPO
+ new GMPO trainer
+ AsyncGRPO weight sync + padding-free
+ more
https://github.com/huggingface/trl/releases/tag/v1.7.0
wrote a small article about the continuous batching for GRPO feature
https://huggingface.co/blog/sergiopaniego/cb-trl-grpo
- Runtime errorRL
CARLA Environment Server
πControl a Carla driving simulation with custom actions
- Runtime errorRL
CARLA Environment Server
πControl a CARLA driving simulator with custom actions
- SleepingAgents
Carla Grpo Trolley
πVisualize your programβs I/O activity in real time
-
sergiopaniego/Qwen3-0.6B-carla-trolley-escape
0.8B β’ Updated β’ 6
- Running3.91k
The Ultra-Scale Playbook
π3.91kThe ultimate guide to training LLM on large GPU Clusters
- Running on CPU UpgradeFeatured3.22k
The Smol Training Playbook
π3.22kThe secrets to building world-class LLMs
- Running330
Evaluation Guidebook
π330Explore LLM benchmark scores over time
- Running231
FineVision: Open Data is All You Need
π231A new open-source dataset for training VLMs
- Runtime errorRL
CARLA Environment Server
πControl a Carla driving simulation with custom actions
- Runtime errorRL
CARLA Environment Server
πControl a CARLA driving simulator with custom actions
- SleepingAgents
Carla Grpo Trolley
πVisualize your programβs I/O activity in real time
-
sergiopaniego/Qwen3-0.6B-carla-trolley-escape
0.8B β’ Updated β’ 6
- Running3.91k
The Ultra-Scale Playbook
π3.91kThe ultimate guide to training LLM on large GPU Clusters
- Running on CPU UpgradeFeatured3.22k
The Smol Training Playbook
π3.22kThe secrets to building world-class LLMs
- Running330
Evaluation Guidebook
π330Explore LLM benchmark scores over time
- Running231
FineVision: Open Data is All You Need
π231A new open-source dataset for training VLMs
spaces 144
VLM Object Understanding
Explore object detection, visual grounding, keypoint Detecti
Qwen2-VL-7B
Ask questions about charts in images
SmolVLM-trl-dpo-rlaif-v
Generate text from an image and question
SmolVLM-trl-sft-ChartQA
Ask questions about charts in images
Trl Text To Sql Trackio
Show a live I/O tracking dashboard
Qwen Sql Demo
Display real-time I/O tracking dashboard