view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • about 23 hours ago • 13
view article Article SSE Retrieval MRL v2: Regularization of Representation Space and Performance Improvement via Hyperparameter Optimization RikkaBotan • 1 day ago • 1
Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient? Paper • 2605.10848 • Published 4 days ago • 4
A Causal Language Modeling Detour Improves Encoder Continued Pretraining Paper • 2605.12438 • Published 3 days ago • 5
jina-embeddings-v5-omni Collection Multimodal (text + image + video + audio) embedding models aligned with jina-embeddings-v5-text-*. Two sizes, four task variants each. • 27 items • Updated 2 days ago • 29
view article Article The State of Arabic Multimodal Embedding — What a 2B Finetune Taught Us Omartificial-Intelligence-Space • 22 days ago • 3
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 6 days ago • 33
EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval Paper • 2301.12005 • Published Jul 3, 2023 • 1
XTR Replicability Collection All the models used in experiments from "A Replicability Study of XTR" • 16 items • Updated 9 days ago • 6
Prism-Reranker: Beyond Relevance Scoring -- Jointly Producing Contributions and Evidence for Agentic Retrieval Paper • 2604.23734 • Published 19 days ago • 3
BidirLM-Embedding Collection BidirLM is a family of 5 frontier bidirectional encoders, including an omnimodal variant at 2.5B. • 6 items • Updated Apr 7 • 7
Embed Mamba2 Collection Text embedding models based on Mamba2 with linear-time and constant-memory inference through vertical chunking. • 5 items • Updated 24 days ago • 3
VISA: Retrieval Augmented Generation with Visual Source Attribution Paper • 2412.14457 • Published Dec 19, 2024 • 1