OPEA
/

DeepSeek-R1-int2-mixed-sym-inc

@@ -1,16 +1,20 @@
 ---
-datasets:
-- NeelNanda/pile-10k
 base_model:
 - deepseek-ai/DeepSeek-R1
----
 ## Model Details
-This model is an int2 model with group_size  64 and symmetric quantization of [deepseek-ai/DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1) generated by [intel/auto-round](https://github.com/intel/auto-round) algorithm.  Some layers are fallback to 4/16 bits. Refer to  Section "Generate the model" for more details of mixed bits setting.
 Please follow the license of the original model. This model could **NOT** run on other severing frameworks.
@@ -439,6 +443,13 @@ The license on this model does not constitute legal advice. We are not responsib
 ## Cite
-@article{cheng2023optimize, title={Optimize weight rounding via signed gradient descent for the quantization of llms}, author={Cheng, Wenhua and Zhang, Weiwei and Shen, Haihao and Cai, Yiyang and He, Xin and Lv, Kaokao and Liu, Yi}, journal={arXiv preprint arXiv:2309.05516}, year={2023} }
-[arxiv](https://arxiv.org/abs/2309.05516) [github](https://github.com/intel/auto-round)

 ---
 base_model:
 - deepseek-ai/DeepSeek-R1
+datasets:
+- NeelNanda/pile-10k
+pipeline_tag: text-generation
+library_name: transformers
+license: apache-2.0
+---
+This model is an int2 model with group_size 64 and symmetric quantization of [deepseek-ai/DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1), generated by the **SignRoundV2** algorithm described in the paper [SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs](https://huggingface.co/papers/2512.04746).
+For more details on the AutoRound project and its implementation, see the [GitHub repository](https://github.com/intel/auto-round).
 ## Model Details
+Some layers are fallback to 4/16 bits. Refer to Section "Generate the model" for more details of mixed bits setting.
 Please follow the license of the original model. This model could **NOT** run on other severing frameworks.
 ## Cite
+```bibtex
+@article{cheng2025signroundv2,
+    title={SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs},
+    author={Cheng, Wenhua and Zhang, Weiwei and Guo, Heng and Shen, Haihao},
+    journal={arXiv preprint arXiv:2512.04746},
+    year={2025}
+}
+```
+[arxiv](https://arxiv.org/abs/2512.04746)