LightOnOCR-2 🦉 Collection LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 5 days ago • 18
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family 7 days ago • 64
view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval +1 Mar 22, 2024 • 126
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling Paper • 2512.23959 • Published 28 days ago • 109
KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model Paper • 2506.20923 • Published Jun 26, 2025 • 10
view article Article TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval Dec 4, 2025 • 19
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 149
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 85
Ministral 3 - Additional Checkpoints Collection Different formats and Quantized versions of our Ministral 3 family; 14B/8B/3B Instruct/Reasoning GGUF, 3B Instruct ONNX and 14B/8B/3B Instruct BF16. • 13 items • Updated Dec 2, 2025 • 18
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 281
view article Article huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning +2 Oct 27, 2025 • 74