Zyphra
/

Zamba2-7B-Instruct

Text Generation

Inference Endpoints

Model card Files Files and versions Community

BerenMillidge commited on Oct 10, 2024

Commit

c9f031a

·

verified ·

1 Parent(s): 107269d

Update README.md

Files changed (1) hide show

README.md +0 -9

README.md CHANGED Viewed

@@ -61,15 +61,6 @@ TODO
 Moreover, due to its unique hybrid SSM architecture, Zamba2-7B-Instruct achieves extremely low inference latency and rapid generation with a significantly smaller memory footprint than comparable transformer-based models.
-<center>
-<img src="https://cdn-uploads.huggingface.co/production/uploads/65c05e75c084467acab2f84a/nHM8bX0y8SWa4zwMSbBi7.png" width="500" alt="Zamba architecture">
-</center>
-<center>
-<img src="https://cdn-uploads.huggingface.co/production/uploads/65c05e75c084467acab2f84a/qXG8aip6h77LHKjhWfjD5.png" width="500" alt="Zamba architecture">
-</center>
 Time to First Token (TTFT)             |  Output Generation
 :-------------------------:|:-------------------------:

 Moreover, due to its unique hybrid SSM architecture, Zamba2-7B-Instruct achieves extremely low inference latency and rapid generation with a significantly smaller memory footprint than comparable transformer-based models.
 Time to First Token (TTFT)             |  Output Generation
 :-------------------------:|:-------------------------: