BoruiXu commited on
Commit
928f115
·
verified ·
1 Parent(s): 3bd77ad

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -1,6 +1,6 @@
1
  run phi3-mini on AMD NPU
2
 
3
- 1. If no ```phi3_mini_awq_4bit_no_flash_attention.pt```, use awq quantization for the original model.
4
  2. Put modeling_phi3.py in this repo into the phi-3-mini folder.
5
  3. Modify the file path in the run_awq.py
6
  4. run ```python run_awq.py --task decode --target aie --w_bit 4```
 
1
  run phi3-mini on AMD NPU
2
 
3
+ 1. If no ```phi3_mini_awq_4bit_no_flash_attention.pt```, use awq quantization to get the quantization model.
4
  2. Put modeling_phi3.py in this repo into the phi-3-mini folder.
5
  3. Modify the file path in the run_awq.py
6
  4. run ```python run_awq.py --task decode --target aie --w_bit 4```