Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published Mar 16 • 155
GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning Paper • 2505.20355 • Published May 26, 2025 • 37
view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq • Dec 11, 2023 • 1.15k
QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference Paper • 2402.10076 • Published Feb 15, 2024 • 2