YCWTG
/

Qwen3-Coder-Next-int2-mixed-AutoRound

Text Generation

Mixture of Experts

Model card Files Files and versions

YCWTG commited on 2 days ago

Commit

e5afd06

·

verified ·

1 Parent(s): 9364f6c

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -16,6 +16,7 @@ pipeline_tag: text-generation
 </p>
 语言 [中文](https://huggingface.co/YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound/blob/main/README_zh.md)|English
 ## Model Details
 This model is an **mixed-bits INT2 quantized** model with group_size 512 and symmetric quantization of [Qwen/Qwen3-Coder-Next](https://huggingface.co/Qwen/Qwen3-Coder-Next) generated by [intel/auto-round](https://github.com/intel/auto-round). Please follow the license of the original model.
@@ -30,6 +31,7 @@ This model is an **mixed-bits INT2 quantized** model with group_size 512 and sym
 | lm_head | Original | Excluded by AutoRound |
 ### Model Size
 - **Original BF16**: ~160GB
 -  **mixed  INT2**:  ~25GB （**84%↓↓**）
@@ -204,7 +206,6 @@ def chat_loop(model, tokenizer):
 if __name__ == "__main__":
     model, tokenizer = load_model()
     chat_loop(model, tokenizer)
 ```
@@ -249,7 +250,6 @@ autoround = AutoRound(
 )
 output_dir="~/.cache/model/Qwen3-Coder-Next-int2-mixed-AutoRound"
 autoround.quantize_and_save(output_dir,format="auto_round" )
 ```
 ## Ethical Considerations and Limitations

 </p>
 语言 [中文](https://huggingface.co/YCWTG/Qwen3-Coder-Next-int2-mixed-AutoRound/blob/main/README_zh.md)|English
 ## Model Details
 This model is an **mixed-bits INT2 quantized** model with group_size 512 and symmetric quantization of [Qwen/Qwen3-Coder-Next](https://huggingface.co/Qwen/Qwen3-Coder-Next) generated by [intel/auto-round](https://github.com/intel/auto-round). Please follow the license of the original model.
 | lm_head | Original | Excluded by AutoRound |
 ### Model Size
 - **Original BF16**: ~160GB
 -  **mixed  INT2**:  ~25GB （**84%↓↓**）
 if __name__ == "__main__":
     model, tokenizer = load_model()
     chat_loop(model, tokenizer)
 ```
 )
 output_dir="~/.cache/model/Qwen3-Coder-Next-int2-mixed-AutoRound"
 autoround.quantize_and_save(output_dir,format="auto_round" )
 ```
 ## Ethical Considerations and Limitations