🏗️ Building on HF

Sergio Paniego PRO

sergiopaniego

huggingface

·

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

updated a dataset about 9 hours ago

agents-course/final-certificates

updated a dataset about 9 hours ago

agents-course/course-certificates-of-excellence

updated a dataset 4 days ago

huggingface-projects/Deep-RL-Course-Certification

View all activity

Organizations

upvoted an article 4 days ago

Article

Run a vLLM Server on HF Jobs in One Command

qgallouedec

•

5 days ago

• 10

upvoted a collection 5 days ago

OpenThinker-Agent2

OpenThinker-Agent2: agentic SFT/RL datasets and 8B/32B models (cold-start SFT, RL, and the OpenThinkerAgent-32B release). • 11 items • Updated 19 days ago • 8

upvoted an article 6 days ago

Article

Building Moon Bot: A Slack-Native Coding Agent Backed by HuggingFace Buckets

huggingface

•

6 days ago

• 42

upvoted a paper 6 days ago

ECHO: Terminal Agents Learn World Models for Free

Paper • 2605.24517 • Published May 23 • 8

upvoted an article 13 days ago

Article

GLM-5.2: Built for Long-Horizon Tasks

zai-org

•

13 days ago

• 111

upvoted 2 articles 15 days ago

Article

I fine-tuned a model for free from one prompt, with TRL and the Google Colab CLI

sergiopaniego

•

15 days ago

• 4

Article

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

sergiopaniego, ariG23498

•

May 25

• 124

upvoted an article 22 days ago

Article

The Open Source Community is backing OpenEnv for Agentic RL

+17

burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, thegovind, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego, banghua

•

23 days ago

• 101

upvoted an article 28 days ago

Article

Agentic RL: Token-In, Token-Out Done Right

huggingface

•

May 29

• 15

upvoted an article 29 days ago

Article

Relaunching PapersWithCode with new features

nielsr

•

May 24

• 12

upvoted an article about 1 month ago

Article

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

+3

ariG23498, sayakpaul, sergiopaniego, ror, pcuenq

•

May 29

• 130

upvoted a collection about 1 month ago

Repo2RLEnv — Verifiable RL Environments

Verifiable RL environments built from real GitHub repos. One dataset per pipeline. Source: https://github.com/huggingface/Repo2RLEnv • 5 items • Updated 12 days ago • 1

upvoted an article about 1 month ago

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

+6

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego

•

May 27

• 42

upvoted 2 collections about 1 month ago

RFDetr

RF-DETR checkpoints converted to be used with 🤗 Transformers • 15 items • Updated May 27 • 17

🧬 Carbon

Carbon 500M, 3B, 8B genomic models and GGUF variants for llama.cpp • 7 items • Updated 28 days ago • 43

upvoted an article about 1 month ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 356

upvoted 2 articles about 2 months ago

Article

Unlocking asynchronicity in continuous batching

+1

ror, pcuenq, ariG23498

•

May 14

• 61

Article

Running AI agents to automate outreach at scale

nielsr

•

Apr 27

• 15

upvoted 2 articles 2 months ago

Article

Pallas for people who know JAX but not kernels yet

ariG23498

•

Apr 29

• 21

Article

The PR you would have opened yourself

pcuenq, awni

•

Apr 16

• 72