view article Article Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel smangrul, sgugger • May 2, 2022 • 9
view article Article Fine-tuning Llama 2 70B using PyTorch FSDP +2 smangrul, sgugger, lewtun, philschmid • Sep 13, 2023 • 32