view article Article Beyond LoRA: Can you beat the most popular fine-tuning technique? +2 BenjaminB, sayakpaul, hubnemo, kashif • 8 days ago • 62
Training Sparse Mixture Of Experts Text Embedding Models Paper • 2502.07972 • Published Feb 11, 2025 • 12
Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models Paper • 2501.14818 • Published Jan 20, 2025 • 10
view article Article Party is over: regularizing ColBERT models to fix efficient ANN methods lightonai • 10 days ago • 23
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking Paper • 2405.07920 • Published May 13, 2024 • 4
F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World Paper • 2603.19223 • Published Mar 19 • 35
Is Position Bias in Dense Retrievers Built In-or Learned from Data? Paper • 2605.26578 • Published May 26 • 20
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published May 9 • 82
view article Article How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas nvidia • Apr 21 • 26
view article Article DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models lightonai • Apr 21 • 42
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 10 items • Updated about 1 month ago • 100
view article Article ATE-2: State-of-the-Art Armenian Text Embeddings and the ArmBench-TextEmbed Benchmark Metric-AI • Mar 19 • 9
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published Mar 12 • 65
Qwen3.5-text-only Collection Text-only versions of Qwen-3.5 without the vision encoders for a smaller memory and storage footprint. • 4 items • Updated 21 days ago • 15
zELO: ELO-inspired Training Method for Rerankers and Embedding Models Paper • 2509.12541 • Published Sep 16, 2025 • 11