view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels drbh, danieldk • Aug 18, 2025 • 97
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante • Aug 5, 2025 • 513
view article Article Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ +1 Wauplin, celinah, julien-c • Jul 25, 2025 • 84
view article Article Asynchronous Robot Inference: Decoupling Action Prediction and Execution +6 fracapuano, imstevenpmwork, aractingi, mshukor, danaaubakirova, AdilZtn, aliberts, cadene • Jul 10, 2025 • 54
view article Article ScreenEnv: Deploy your full stack Desktop Agent A-Mahla, m-ric • Jul 10, 2025 • 76
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 773
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders thomwolf, matthieu-lapeyre • Jul 9, 2025 • 796
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers tomaarsen, arthurbresnu • Jul 1, 2025 • 138
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf • Sep 18, 2024 • 280
view article Article Learn the Hugging Face Kernel Hub in 5 Minutes +5 drbh, danieldk, Narsil, pcuenq, pagezyhf, merve, reach-vb • Jun 12, 2025 • 164
view article Article 💥 Building a Vulnerable Bank MCP — Then Automating an Agent to Hack It jdelavande • Jun 18, 2025 • 8
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2, 2025 • 159
view changelog Hugging Face Changelog AI-generated Abstract summaries on Hugging Face Papers May 22, 2025 • 75
view changelog Hugging Face Changelog Xet is now the default storage option for new users and organizations May 23, 2025 • 76
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance tngtech • Apr 16, 2025 • 76
view article Article Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability sasha • May 7, 2025 • 17
view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. tiiuae • May 15, 2025 • 36
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference mfuntowicz, hlarcher • Jan 16, 2025 • 76