view article Article SFT with vLLM Downstream Evaluation: A VRAM-Efficient Pipeline (arm64) AlioLeuchtmann • Jan 11 • 3
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective LinkedIn • Jan 27 • 80
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL +4 toslali-ibm, mirinflim, qgallouedec, esnible, rganti, mudhakar • Jun 3, 2025 • 101
view article Article Fixing Gradient Accumulation +4 lysandre, ArthurZ, muellerzr, ydshieh, BenjaminB, pcuenq • Oct 16, 2024 • 66
view article Article Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2 +4 RQlee, ArthurZ, achikundu, lwtr, rganti, mayank-mishra • Aug 21, 2024 • 41
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention sirluk • Oct 7, 2024 • 71
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs davidberenstein1957 • May 7, 2025 • 42
view article Article Saving Memory Using Padding-Free Transformer Layers during Finetuning mayank-mishra • Jun 11, 2024 • 21