Quentin Gallouédec's picture

Hiring 💼

Quentin Gallouédec PRO

qgallouedec

huggingface

·

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

hf-doc-build/doc-build-dev

new activity 1 day ago

trl-internal-testing/tiny-Llama4ForCausalLM:attn_temperature_tuning should be bool

upvoted a paper 5 days ago

Fewer Truncations Improve Language Modeling

View all activity

Organizations

upvoted a paper 5 days ago

Fewer Truncations Improve Language Modeling

Paper • 2404.10830 • Published Apr 16, 2024 • 5

upvoted an article 7 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

9 days ago

•

63

upvoted an article 8 days ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

9 days ago

•

182

upvoted an article 15 days ago

Article

Bringing Autonomous Driving RL to OpenEnv and TRL

20 days ago

•

21

upvoted a collection 16 days ago

Qwen3.5

21 items • Updated 9 days ago • 1.23k

upvoted 2 articles 23 days ago

Article

Did GPT 5.2 make a breakthrough discovery in theoretical physics?

27 days ago

•

61

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

27 days ago

•

487

upvoted a paper 26 days ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 29 days ago • 115

upvoted a paper about 1 month ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published Feb 3 • 12

upvoted 3 articles about 1 month ago

Article

Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments

Jan 20

•

11

Article

Transformers.js v4 Preview: Now Available on NPM!

Feb 9

•

77

Article

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Feb 11, 2025

•

107

upvoted 3 papers about 1 month ago

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published Feb 4 • 37

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30, 2025 • 31

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published Feb 5 • 51

upvoted an article about 2 months ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

+3

Jan 18, 2024

•

79

upvoted a collection about 2 months ago

AlphaGenome

Collection of AlphaGenome models. • 5 items • Updated 7 days ago • 35

upvoted an article about 2 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

307

upvoted 2 papers about 2 months ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 90