ISTA-DASLab
/

DeepSeek-R1-GPTQ-4b-128g-experts

Text Generation

text-generation-inference

compressed-tensors

Model card Files Files and versions

SpiridonSunRotator commited on Apr 7, 2025

Commit

1348da8

·

verified ·

1 Parent(s): 4cf3b35

Added OpenLLM leaderboard evaluation

Files changed (1) hide show

README.md +11 -0

README.md ADDED Viewed

	@@ -0,0 +1,11 @@

+### Evaluation
+This model was evaluated on the OpenLLM v1 benchmarks and reasoning tasks (AIME24, GPQA-Diamond, MATH500) . Model outputs were generated with the vLLM engine.
+|                               | ArcC | GSM8k | Hellaswag | MMLU | TruthfulQA-mc2 | Winogrande | Average | Recovery |
+|-------------------------------|---------------|-------|-----------|------|------------|------------|---------------|----------|
+| deepseek-ai/DeepSeek-R1 |     72.53      | 95.91 |   89.83    | 87.22 |   59.28    |     82.00     |     81.04      |   100.00    |
+| cognitivecomputations/DeepSeek-R1-AWQ    |     73.12     | 95.15 |   89.07   | 86.86|   60.09    |   82.32    |      81.10     |  100.07  |
+| ISTA-DASLab/DeepSeek-R1-GPTQ-4b-128g-experts (this) |     72.53     | 95.68 |   89.36   | 86.99|   59.77    |   83.35    |     81.28     |  100.30   |