Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 7 items • Updated 11 days ago • 24
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate +2 Jun 13, 2024 • 62