Update README.md
Browse files
README.md
CHANGED
|
@@ -42,7 +42,7 @@ tags:
|
|
| 42 |
|
| 43 |
### Model Optimizations
|
| 44 |
|
| 45 |
-
This model was obtained by quantizing the weights of [phi-4]https://huggingface.co/microsoft/phi-4) to INT4 data type.
|
| 46 |
This optimization reduces the number of bits per parameter from 16 to 4, reducing the disk size and GPU memory requirements by approximately 75%.
|
| 47 |
|
| 48 |
Only the weights of the linear operators within transformers blocks are quantized.
|
|
|
|
| 42 |
|
| 43 |
### Model Optimizations
|
| 44 |
|
| 45 |
+
This model was obtained by quantizing the weights of [phi-4](https://huggingface.co/microsoft/phi-4) to INT4 data type.
|
| 46 |
This optimization reduces the number of bits per parameter from 16 to 4, reducing the disk size and GPU memory requirements by approximately 75%.
|
| 47 |
|
| 48 |
Only the weights of the linear operators within transformers blocks are quantized.
|