view article Article CUGA on Hugging Face: Democratizing Configurable AI Agents ibm-research • Dec 15, 2025 • 67
view article Article Gaia2 and ARE: Empowering the community to study agents +9 clefourrier, gregmialz, mlcu, mortimerp9, XciD, tfrere, evijit, RomainFroger, dheeraj7596, CarolinePascal, upiter • Sep 22, 2025 • 134
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning Paper • 2509.13305 • Published Sep 16, 2025 • 91
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated Mar 10 • 634
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency not-lain • Jan 30, 2025 • 329
view article Article Welcome to Inference Providers on the Hub 🔥 +5 burkaygur, zeke, aton2006, hassanelmghari, sbrandeis, kramp, julien-c • Jan 28, 2025 • 495
Marqo-Ecommerce-Embeddings Collection State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP. • 10 items • Updated Nov 14, 2024 • 18
view article Article Llama can now see and run on your device - welcome Llama 3.2 +5 merve, philschmid, osanseviero, reach-vb, lewtun, ariG23498, pcuenq • Sep 25, 2024 • 191
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Mar 12 • 89
view article Article WWDC 24: Running Mistral 7B with Core ML +2 pcuenq, FL33TW00D-HF, reach-vb, osanseviero • Jul 22, 2024 • 65
FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 42 items • Updated Mar 2 • 80
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡ xhluca • Jul 9, 2024 • 83
view article Article Training and Finetuning Embedding Models with Sentence Transformers tomaarsen • May 28, 2024 • 274