LettuceDetect: A Hallucination Detection Framework for RAG Applications Paper • 2502.17125 • Published Feb 24, 2025 • 14
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 21 days ago • 69
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 28 days ago • 57
Squeez: Task-Conditioned Tool-Output Pruning for Coding Agents Paper • 2604.04979 • Published Apr 4 • 10
view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval +1 Mar 22, 2024 • 131
When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA Paper • 2510.04849 • Published Oct 6, 2025 • 117
Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report Paper • 2510.14880 • Published Oct 16, 2025 • 19
view article Article Granite Embedding R2: Setting New Standards for Enterprise Retrieval Oct 14, 2025 • 16
mmBERT: A Modern Multilingual Encoder with Annealed Language Learning Paper • 2509.06888 • Published Sep 8, 2025 • 15
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 • 273
Medical and Scientific Literature Models Collection Models for working with medical and scientific literature. • 17 items • Updated 8 days ago • 12
Hallucination detection Collection Trained ModernBERT (base and large) for detection hallucinations in LLM responses. The models are trained as token classifications. • 4 items • Updated May 18, 2025 • 19
view article Article Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications Aug 29, 2025 • 27
TinyLettuce Collection This Collection contains our small, Ettin-encoder (https://arxiv.org/abs/2507.11412) based models trained on synthetic and RagTruth data. • 6 items • Updated Aug 31, 2025 • 4