🔄 In a Training Loop

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

upvoted an article 1 day ago

Security incident disclosure — July 2026

liked a Space 1 day ago

ICML-2026-agent-repro/challenge

liked a Space 6 days ago

joelniklaus/harness-optimization

View all activity

Organizations

upvoted an article 1 day ago

Article

Security incident disclosure — July 2026

system

•

6 days ago

• 328

upvoted 2 articles 8 days ago

Article

Native-speed vLLM transformers modeling backend

hmellor, lysandre

•

14 days ago

• 57

Article

J-Space: Yet Another LLM Mind Reader?

dlouapre

•

9 days ago

• 31

upvoted a paper 20 days ago

AsyncOPD: How Stale Can On-Policy Distillation Be?

Paper • 2606.24143 • Published 29 days ago • 30

upvoted a changelog 26 days ago

Hugging Face Changelog

Share your feedback with us

26 days ago

• 134

upvoted an article 28 days ago

Article

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

sergiopaniego, ariG23498

•

May 25

• 134

upvoted 2 articles about 1 month ago

Article

GLM-5.2: Built for Long-Horizon Tasks

zai-org

•

Jun 17

• 134

Article

PhysicsIntern: from an Autonomous Benchmark-runner to a Research Sidekick

dlouapre

•

Jun 11

• 7

upvoted 2 articles about 2 months ago

Article

Reachy Mini goes fully local

A-Mahla, andito

•

May 27

• 69

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

+6

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego

•

May 27

• 43

upvoted a collection 2 months ago

🧬 Carbon

Carbon 500M, 3B, 8B genomic models and GGUF variants for llama.cpp • 7 items • Updated Jun 2 • 44

upvoted an article 2 months ago

Article

Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law

mishig

•

May 11

• 24

upvoted an article 3 months ago

Article

Running AI agents to automate outreach at scale

nielsr

•

Apr 27

• 15

upvoted a paper 3 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 67

upvoted an article 3 months ago

Article

ML Intern Takes Our Post-Training Internship Test

cmpatino

•

Apr 23

• 31

upvoted a paper 3 months ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published Apr 1 • 56

upvoted a changelog 3 months ago

Hugging Face Changelog

Agent Traces on the Hub

Apr 7

• 152

upvoted an article 3 months ago

Article

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

nielsr

•

Apr 7

• 62

upvoted a collection 4 months ago

Trinity-Large-Thinking

5 items • Updated Apr 10 • 32

upvoted an article 4 months ago

Article

How I contributed a new model to the Transformers library using Codex

nielsr

•

Mar 30

• 52