view article Article Provence: efficient and robust context pruning for retrieval-augmented generation nadiinchi • Jan 28, 2025 • 26
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf • Sep 18, 2024 • 280
view article Article From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease muellerzr • Oct 21, 2022 • 44
view article Article Accelerating Document AI +2 rajistics, nielsr, florentgbelidji, nbroad • Nov 21, 2022 • 80