Update README.md
Browse files
README.md
CHANGED
|
@@ -1,6 +1,6 @@
|
|
| 1 |
run phi3-mini on AMD NPU
|
| 2 |
|
| 3 |
-
1. If no ```phi3_mini_awq_4bit_no_flash_attention.pt```, use awq quantization
|
| 4 |
2. Put modeling_phi3.py in this repo into the phi-3-mini folder.
|
| 5 |
3. Modify the file path in the run_awq.py
|
| 6 |
4. run ```python run_awq.py --task decode --target aie --w_bit 4```
|
|
|
|
| 1 |
run phi3-mini on AMD NPU
|
| 2 |
|
| 3 |
+
1. If no ```phi3_mini_awq_4bit_no_flash_attention.pt```, use awq quantization to get the quantization model.
|
| 4 |
2. Put modeling_phi3.py in this repo into the phi-3-mini folder.
|
| 5 |
3. Modify the file path in the run_awq.py
|
| 6 |
4. run ```python run_awq.py --task decode --target aie --w_bit 4```
|