Zyphra
/

Zamba2-7B-Instruct

Text Generation

Model card Files Files and versions

BerenMillidge commited on Oct 11, 2024

Commit

eec7dd4

·

verified ·

1 Parent(s): 31be28f

Update README.md

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -56,7 +56,19 @@ model = AutoModelForCausalLM.from_pretrained("Zyphra/Zamba2-7B", device_map="cud
 Zamba2-7B-Instruct punches dramatically above its weight, achieving extremely strong instruction-following benchmark scores.
-TODO
 Moreover, due to its unique hybrid SSM architecture, Zamba2-7B-Instruct achieves extremely low inference latency and rapid generation with a significantly smaller memory footprint than comparable transformer-based models.

 Zamba2-7B-Instruct punches dramatically above its weight, achieving extremely strong instruction-following benchmark scores.
+<div style="width: 50%; margin: auto;">
+| Task         | Score |
+|:------------:|:---------:|
+| IFEval       | 69.95 |
+| BBH          | 33.33 |
+| MATH Lvl 5   | 13.57 |
+| GPQA         | 10.28 |
+| MUSR         |  8.21 |
+| MMLU-PRO     | 32.43 |
+| **Average**      | **27.96** |
+</div>
 Moreover, due to its unique hybrid SSM architecture, Zamba2-7B-Instruct achieves extremely low inference latency and rapid generation with a significantly smaller memory footprint than comparable transformer-based models.