view article Article Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL +5 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra • 2 days ago • 23
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation Paper • 2511.17290 • Published Nov 21, 2025 • 1
🇪🇪 Estonian LLM Evaluation Collection A collection of resources for evaluation of LLM capabilities in the Estonian language. • 33 items • Updated Dec 13, 2025 • 5
Multilingual Benchmarks Collection Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets (ACL 2026) • 29 items • Updated Apr 11 • 3
Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets Paper • 2602.22207 • Published Feb 25 • 44
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 burtenshaw, SaylorTwift, kramp, merve, davanstrien, nielsr, julien-c • Feb 4 • 90
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published Feb 1 • 45
Jupyter Agent Collection Blog: https://huggingface.co/blog/jupyter-agent-2 • 4 items • Updated Sep 12, 2025 • 3
MamayLM-v1.0-Gemma-3 Collection First Open and Multimodal Ukrainian-focused LLM • 5 items • Updated Oct 8, 2025 • 21
view article Article Smol2Operator: Post-Training GUI Agents for Computer Use +3 A-Mahla, merve, sergiopaniego, reach-vb, lewtun • Sep 23, 2025 • 138
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez • Sep 11, 2025 • 188
view article Article Jupyter Agents: training LLMs to reason with notebooks +1 baptistecolle, hannayukhymenko, lvwerra • Sep 10, 2025 • 65
view article Article mmBERT: ModernBERT goes Multilingual +4 mmarone, orionweller, will-fleshman, eugene-yang, dlawrie, vandurme • Sep 9, 2025 • 147
Apertus LLM Collection Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1, 2025 • 350
view article Article Announcing UA-Code-Bench: a New Benchmark for Evaluating LLMs on Competitive Programming Tasks in Ukrainian anon-researcher-ua • Jul 12, 2025 • 2
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 smohammadi, siro1, winglian, marcsun13, djsaunde • Aug 8, 2025 • 98