view post Post 2660 Native tensor parallel has landed in transformers!!! https://github.com/huggingface/transformers/pull/34184 thanks a lot to the torch team for their support! Contributions are welcome to support more models! 🔥
view post Post mamba is now available in transformers. Thanks to @tridao and @albertgu for this brilliant model! 🚀 and the amazing mamba-ssm kernels powering this!Checkout the collection here: state-spaces/transformers-compatible-mamba-65e7b40ab87e5297e45ae406
Mamba Mamba checkpoints compatible with transformers ArthurZ/mamba-2.8b Text Generation • Updated Mar 4 • 15 • 1 ArthurZ/mamba-2.8b-slimpj Text Generation • Updated Feb 19 • 23 ArthurZ/mamba-1.4b Text Generation • Updated Feb 29 • 16 ArthurZ/mamba-790m Text Generation • Updated Feb 29 • 25