GTE models Collection General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 20 items • Updated Mar 2 • 36
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 50 items • Updated 22 days ago • 678
SWE-bench Collection SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues. • 4 items • Updated Mar 8, 2025 • 9
Running on CPU Upgrade 596 GAIA Leaderboard 🦾 596 Submit your model answers to GAIA benchmark and view leaderboard