Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 7 days ago • 16
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate Jun 13 • 45