Leaderboards and benchmarks ✨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 88 items • Updated Mar 2 • 118
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation +7 yuxiang630, cassanof, ganler, YifengDing, StringChaos, harmdevries, lvwerra, arjunguha, lingming • Apr 29, 2024 • 79
view article Article Hugging Face x LangChain : A new partner package +1 Jofthomas, kkondratenko, efriis • May 14, 2024 • 161
view article Article Announcing New Dataset Search Features +1 lhoestq, severo, kramp • Jul 8, 2024 • 23
view article Article SmolLM - blazingly fast and remarkably powerful +1 loubnabnl, anton-l, eliebak • Jul 16, 2024 • 455
view article Article FineVideo: behind the scenes +4 mfarre, andito, lewtun, lvwerra, pcuenq, thomwolf • Sep 23, 2024 • 35
view article Article Argilla 2.4: Easily Build Fine-Tuning and Evaluation Datasets on the Hub — No Code Required +1 nataliaElv, burtenshaw, dvilasuero • Nov 4, 2024 • 45
view article Article Judge Arena: Benchmarking LLMs as Evaluators +6 kaikaidai, MauriceBurg, RomanEngeler1805, mbartolo, clefourrier, tobydrane, mathias-atla, jacksongolden • Nov 19, 2024 • 63
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language +4 davidberenstein1957, sdiazlor, Leiyre, dvilasuero, Ameeeee, burtenshaw • Dec 16, 2024 • 158
view article Article Introducing smolagents: simple agents that write actions in code. +1 m-ric, merve, thomwolf • Dec 31, 2024 • 1.19k