ISTA-DASLab
/

DeepSeek-R1-GPTQ-4b-128g-experts

Text Generation

text-generation-inference

compressed-tensors

Model card Files Files and versions

ekurtic commited on Apr 8, 2025

Commit

16791da

·

verified ·

1 Parent(s): 42d1399

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ This model was evaluated on the OpenLLM v1 benchmarks and reasoning tasks (AIME-
 Model outputs were generated with the vLLM engine.
-For reasoning tasks we sample 10 solutions for each seed with `temperature=0.6`, `top_p=0.95` and `max_new_tokens=32768`.
 `OpenLLM v1 `

 Model outputs were generated with the vLLM engine.
+For reasoning tasks we estimate pass@1 based on 10 runs with different seeds and `temperature=0.6`, `top_p=0.95` and `max_new_tokens=32768`.
 `OpenLLM v1 `