danish-foundation-models/ai-arenaen-conversations Viewer • Updated about 4 hours ago • 2.9k • 20
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 49
Dynaword: From One-shot to Continuously Developed Datasets Paper • 2508.02271 • Published Aug 4, 2025 • 15
DeToNATION: Decoupled Torch Network-Aware Training on Interlinked Online Nodes Paper • 2502.06728 • Published Feb 10, 2025
Guarded Query Routing for Large Language Models Paper • 2505.14524 • Published May 20, 2025 • 2
HUME: Measuring the Human-Model Performance Gap in Text Embedding Task Paper • 2510.10062 • Published Oct 11, 2025 • 10
Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models? Paper • 2502.11895 • Published Feb 17, 2025 • 3