Search for Coverage: Learning Coverage-Aware Retrieval with Augmented Sub-Question Answerability Paper • 2605.28522 • Published about 1 month ago • 2
view article Article MTEB Leaderboard: From a slow demo to feature-rich leaderboard Samoed • 14 days ago • 22
MILCO: Multilingual Learned Sparse Retrieval Collection MILCO maps queries and documents from different languages into a shared English lexical space via a multilingual connector. • 4 items • Updated May 15 • 4
Milco: Learned Sparse Retrieval Across Languages via a Multilingual Connector Paper • 2510.00671 • Published Oct 1, 2025 • 3
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 167
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 2 days ago • 21
DenseOn & LateOn Collection A collection of open state-of-the-art single and multi-vector models • 8 items • Updated 2 days ago • 12
ChatR1: Reinforcement Learning for Conversational Reasoning and Retrieval Augmented Question Answering Paper • 2510.13312 • Published Oct 15, 2025 • 2
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 36 items • Updated 11 days ago • 222
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers tomaarsen, arthurbresnu • Jul 1, 2025 • 138
On the Challenges and Opportunities of Learned Sparse Retrieval for Code Paper • 2603.22008 • Published Mar 23 • 4
view article Article PISCO-OSCAR: embeddings for efficient Retrieval-Augmented Generation maxoul • Jun 18, 2025 • 3
QReCC Collection QReCC (Question Rewriting in Conversational Context) for passage retrieval and QA • 2 items • Updated Aug 25, 2025 • 1
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 447