Running on CPU Upgrade Agents 76 AIR-Bench Leaderboard 🥇 76 Explore and compare QA and long doc benchmarks
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement Paper • 2606.11926 • Published 19 days ago • 120
hanhainebula/reason-embed-qwen3-4b-0928 Feature Extraction • 4B • Updated 20 days ago • 443 • 4
hanhainebula/reason-embed-qwen3-8b-0928 Feature Extraction • 8B • Updated 20 days ago • 149 • 2
ReasonEmbed Collection ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval • 14 items • Updated 20 days ago • 2
hanhainebula/reason-embed-annotator-qwen3-8b-0928 Text Generation • 8B • Updated 20 days ago • 14
hanhainebula/reason-embed-annotator-qwen3-8b-0928 Text Generation • 8B • Updated 20 days ago • 14
AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery Paper • 2604.25256 • Published Apr 28 • 30