YCWTG
/

Qwen3-Coder-30B-A3B-Instruct-int4-mixed-AutoRound

Text Generation

Mixture of Experts

4-bit precision

Model card Files Files and versions

YCWTG commited on Feb 27

Commit

a11e67e

·

verified ·

1 Parent(s): 75df2b1

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ base_model:
 tags:
   - qwen3
   - moe
-  - int3
   - quantized
   - autoround
 license: apache-2.0
@@ -268,6 +268,7 @@ Users (both direct and downstream) should be made aware of the risks, biases and
 Here are a couple of useful links to learn more about Intel's AI software:
 - [Intel Neural Compressor](https://github.com/intel/neural-compressor)
 ## Disclaimer
@@ -277,4 +278,4 @@ The license on this model does not constitute legal advice. We are not responsib
 @article{cheng2023optimize, title={Optimize weight rounding via signed gradient descent for the quantization of llms}, author={Cheng, Wenhua and Zhang, Weiwei and Shen, Haihao and Cai, Yiyang and He, Xin and Lv, Kaokao and Liu, Yi}, journal={arXiv preprint arXiv:2309.05516}, year={2023} }
-[arxiv](https://arxiv.org/abs/2309.05516) [github](https://github.com/intel/auto-round)

 tags:
   - qwen3
   - moe
+  - int4
   - quantized
   - autoround
 license: apache-2.0
 Here are a couple of useful links to learn more about Intel's AI software:
 - [Intel Neural Compressor](https://github.com/intel/neural-compressor)
+- [AutoRound](https://github.com/intel/auto-round)
 ## Disclaimer
 @article{cheng2023optimize, title={Optimize weight rounding via signed gradient descent for the quantization of llms}, author={Cheng, Wenhua and Zhang, Weiwei and Shen, Haihao and Cai, Yiyang and He, Xin and Lv, Kaokao and Liu, Yi}, journal={arXiv preprint arXiv:2309.05516}, year={2023} }
+[arxiv](https://arxiv.org/abs/2309.05516)