BoruiXu
/

phi3_mini_amd_NPU

Model card Files Files and versions

BoruiXu commited on Jul 4, 2024

Commit

928f115

·

verified ·

1 Parent(s): 3bd77ad

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 run phi3-mini on AMD NPU
-1. If no ```phi3_mini_awq_4bit_no_flash_attention.pt```, use awq quantization for the original model.
 2. Put modeling_phi3.py in this repo into the phi-3-mini folder.
 3. Modify the file path in the run_awq.py
 4. run ```python run_awq.py --task decode --target aie --w_bit 4```

 run phi3-mini on AMD NPU
+1. If no ```phi3_mini_awq_4bit_no_flash_attention.pt```, use awq quantization to get the quantization model.
 2. Put modeling_phi3.py in this repo into the phi-3-mini folder.
 3. Modify the file path in the run_awq.py
 4. run ```python run_awq.py --task decode --target aie --w_bit 4```