view article Article Introducing North Mini Code: Cohere’s First Model For Developers CohereLabs • 17 days ago • 75
Running Agents 432 Reward Bench Leaderboard 📐 432 Explore and compare model scores on RewardBench benchmarks
view article Article The Transformers Library: standardizing model definitions +2 lysandre, ArthurZ, pcuenq, julien-c • May 15, 2025 • 123
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published Feb 16, 2025 • 170
view article Article MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before isaacchung • Apr 24, 2025 • 18
Runtime error Featured 142 smolagents LLM leaderboard 🏆 142 A leaderboard for LLMs powering smolagents
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking Paper • 2405.07920 • Published May 13, 2024 • 4