inference-optimization
/

sarvam-30b-FP8-Dynamic

Text Generation

compressed-tensors

Model card Files Files and versions

krishnateja95 commited on Mar 9

Commit

2bf0a5b

·

verified ·

1 Parent(s): ac471ce

Update README.md

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

	@@ -145,3 +145,18 @@ This model was evaluated on the well-known text benchmarks using [lm-evaluation-
145
146
147	### Accuracy

 ### Accuracy
+| Benchmark | sarvamai/sarvam-30b | RedHatAI/sarvam-30b-FP8-Dynamic | Recovery (%) |
+|---|---|---|---|
+| BBH (exact_match) | 63.32 | 62.95 | 99.42% |
+| GSM8K (strict-match) | 72.33 | 72.40 | 100.10% |
+| GSM8K (flexible-extract) | 69.67 | 70.81 | 101.63% |
+| IFEval (inst_level_strict_acc) | 34.17 | 31.65 | 92.63% |
+| MMLU-Pro (exact_match) | 45.69 | 45.81 | 100.25% |
+| ARC-Challenge (acc) | 58.28 | 57.76 | 99.12% |
+| HellaSwag (acc) | 53.98 | 53.98 | 100.00% |
+| MMLU (acc) | 66.20 | 66.15 | 99.92% |
+| TruthfulQA MC2 (acc) | 50.34 | 50.58 | 100.48% |
+| Winogrande (acc) | 61.09 | 61.17 | 100.13% |