view article Article Beyond LoRA: Can you beat the most popular fine-tuning technique? +2 BenjaminB, sayakpaul, hubnemo, kashif • 7 days ago • 60
Training Sparse Mixture Of Experts Text Embedding Models Paper • 2502.07972 • Published Feb 11, 2025 • 12
Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models Paper • 2501.14818 • Published Jan 20, 2025 • 10
VAGOsolutions/SauerkrautLM-Multi-Reason-ModernColBERT Sentence Similarity • 0.1B • Updated Aug 3, 2025 • 24.8k • • 15
view article Article Party is over: regularizing ColBERT models to fix efficient ANN methods lightonai • 8 days ago • 23