MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encodings Paper • 2405.19504 • Published May 29, 2024 • 4
AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive Agents Paper • 2603.09716 • Published Mar 10 • 1
OpenThinker-Agent-Complete Collection OpenThinkerAgent-32B SFT data-scaling ladder (models + matching datasets, 316->100K) plus TaskTrove & AgentTrove sources. • 15 items • Updated 15 days ago • 4
view article Article Party is over: regularizing ColBERT models to fix efficient ANN methods lightonai • 9 days ago • 23
Accurate Chemistry Collection: Coupled cluster atomization energies for broad chemical space Paper • 2506.14492 • Published Jun 17, 2025 • 5
Accurate and scalable exchange-correlation with deep learning Paper • 2506.14665 • Published Apr 21 • 6
Skala Collection Accurate and scalable exchange-correlation with deep learning • 6 items • Updated Apr 29 • 5
BugPilot: Complex Bug Generation for Efficient Learning of SWE Skills Paper • 2510.19898 • Published Oct 22, 2025 • 4
SWE-FastContext Collection A family of code-search models powering the Explore subagent for coding agents. • 3 items • Updated 8 days ago • 15
Querit-Reranker: Training Compact Multilingual Rerankers via Efficient Label-Free Distribution Adaptation Paper • 2606.19037 • Published 9 days ago • 2
CycliST: A Video Language Model Benchmark for Reasoning on Cyclical State Transitions Paper • 2512.01095 • Published Nov 30, 2025 • 1
OpenFARM Collection Toward expert models for Animal Welfare Assessment • 9 items • Updated May 18 • 1
Reasoning Datasets Collection Synthetic datasets generated using reasoning models, primarily the Deepseek-R1 and Deepseek-V3 series. • 16 items • Updated 4 days ago • 6
SpreadsheetArena: Decomposing Preference in LLM Generation of Spreadsheet Workbooks Paper • 2603.10002 • Published Feb 16 • 2
MyPCBench: A Benchmark for Personally Intelligent Computer-Use Agents Paper • 2606.16748 • Published 11 days ago • 6
Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition Paper • 2305.05084 • Published May 8, 2023 • 6
Healthsheet: Development of a Transparency Artifact for Health Datasets Paper • 2202.13028 • Published Feb 26, 2022 • 2