ISTA-DASLab
/

DeepSeek-R1-0528-GPTQ-4b-128g-experts

Text Generation

text-generation-inference

compressed-tensors

Model card Files Files and versions

ekurtic commited on Jun 2, 2025

Commit

ccf4dbe

·

verified ·

1 Parent(s): c5e4d60

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -23,9 +23,13 @@ Model outputs were generated with the vLLM engine.
 For reasoning tasks we estimate pass@1 based on 10 runs with different seeds and `temperature=0.6`, `top_p=0.95` and `max_new_tokens=65536`.
-#### Reasoning tasks (AIME-24, GPQA-Diamond, MATH-500)
-... coming soon ...
 ## Contributors
 Denis Kuznedelev (Yandex), Eldar Kurtić (Red Hat AI & ISTA), and Dan Alistarh (Red Hat AI & ISTA).

 For reasoning tasks we estimate pass@1 based on 10 runs with different seeds and `temperature=0.6`, `top_p=0.95` and `max_new_tokens=65536`.
+|                             | Recovery (%) | deepseek/DeepSeek-R1-0528 | ISTA-DASLab/DeepSeek-R1-0528-GPTQ-4b-128g-experts<br>(this model) |
+| --------------------------- | :----------: | :------------------: | :--------------------------------------------------: |
+| AIME 2024<br>pass@1         | 98.50         | 88.66                | 87.33                                                |
+| MATH-500<br>pass@1          | 99.88        | 97.52                | 97.40                                                |
+| GPQA Diamond<br>pass@1      | 101.21        | 79.65                | 80.61                                                |
+| **Reasoning<br>Average Score**  | **99.82**        | **88.61**                | **88.45**                                                |
 ## Contributors
 Denis Kuznedelev (Yandex), Eldar Kurtić (Red Hat AI & ISTA), and Dan Alistarh (Red Hat AI & ISTA).