Intel
/

DeepSeek-V3.1-Terminus-int4-mixed-AutoRound

Text Generation

4-bit precision

Model card Files Files and versions

wenhuach commited on Sep 23, 2025

Commit

a03f640

·

verified ·

1 Parent(s): d7f42e2

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -10,6 +10,9 @@ This model is a mixed int4 model with group_size 128 and symmetric quantization
 Non expert layers are fallback to 8 bits. Please refer to Section Generate the model for more details.
 Please follow the license of the original model.
 ## How To Use
 ### INT4 Inference

 Non expert layers are fallback to 8 bits. Please refer to Section Generate the model for more details.
 Please follow the license of the original model.
+**The `e_score_correction_bias` is stored in BF16**  because, when loaded in Transformers, its dtype is automatically converted to BF16. As a result, it is difficult for us to preserve it in FP32 within our tools.
+Please use it with causion
 ## How To Use
 ### INT4 Inference