jingyux-nv commited on
Commit
48bfe38
·
verified ·
1 Parent(s): 56fbbe4

Fix benchmark note: remove inaccurate FP8 baseline line, correct max tokens to 384000

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -166,8 +166,7 @@ The accuracy benchmark results are presented in the table below:
166
  </tr>
167
  </table>
168
 
169
- > Baseline (ours): our measurement of [DeepSeek-V4-Flash](https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash) in its native FP8 format.
170
- > Benchmarked with temperature=1.0, top_p=1.0, max num tokens 64000
171
 
172
  Model Limitations:
173
  The base model was trained on data that contains toxic language and societal biases originally crawled from the internet. Therefore, the model may amplify those biases and return toxic responses especially when prompted with toxic prompts. The model may generate answers that may be inaccurate, omit key information, or include irrelevant or redundant text producing socially unacceptable or undesirable text, even if the prompt itself does not include anything explicitly offensive.
 
166
  </tr>
167
  </table>
168
 
169
+ > Benchmarked with temperature=1.0, top_p=1.0, max num tokens 384000
 
170
 
171
  Model Limitations:
172
  The base model was trained on data that contains toxic language and societal biases originally crawled from the internet. Therefore, the model may amplify those biases and return toxic responses especially when prompted with toxic prompts. The model may generate answers that may be inaccurate, omit key information, or include irrelevant or redundant text producing socially unacceptable or undesirable text, even if the prompt itself does not include anything explicitly offensive.