Making Large Language Models Efficient Dense Retrievers Paper • 2512.20612 • Published 3 days ago • 2
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts Paper • 2503.05066 • Published Mar 7 • 4