view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 6 days ago • 52
view article Article Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries Dec 22, 2025 • 8
view article Article Introducing Daggr: Chain apps programmatically, inspect visually +3 12 days ago • 94
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective 14 days ago • 51
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 119
view article Article RexRerankers: SOTA Rankers for Product Discovery and AI Assistants 17 days ago • 44
Embedding Models Collection Run or fine-tune embedding models with Unsloth. • 14 items • Updated 6 days ago • 3
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family 21 days ago • 77
Falcon-H1-Tiny Collection A series of extremely small, yet powerful language models redefining capabilities at small scale • 22 items • Updated 25 days ago • 35
view article Article How We Built a Semantic Highlight Model To Save Token Cost for RAG 26 days ago • 64
Towards General Text Embeddings with Multi-stage Contrastive Learning Paper • 2308.03281 • Published Aug 7, 2023 • 3
TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval Paper • 2511.16528 • Published Nov 20, 2025 • 24
BERT Hash Nano Models Collection Set of BERT models with a modified embeddings layer • 8 items • Updated 12 days ago • 9