From Distillation to Hard Negative Sampling: Making Sparse Neural IR Models More Effective Paper • 2205.04733 • Published May 10, 2022 • 2
opensearch-project/opensearch-neural-sparse-encoding-doc-v2-distill Feature Extraction • 67M • Updated Jun 30, 2025 • 1.97M • • 19
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Paper • 1810.04805 • Published Oct 11, 2018 • 25
One Embedder, Any Task: Instruction-Finetuned Text Embeddings Paper • 2212.09741 • Published Dec 19, 2022 • 4
Improving Text Embeddings with Large Language Models Paper • 2401.00368 • Published Dec 31, 2023 • 82
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data Paper • 2502.08468 • Published Feb 12, 2025 • 16
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated Mar 6, 2025 • 150M • • 4.38k
GooAQ: Open Question Answering with Diverse Answer Types Paper • 2104.08727 • Published Apr 18, 2021 • 1
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 410
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 705