Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -42,7 +42,7 @@ tags:
 ### Model Optimizations
-This model was obtained by quantizing the weights of [phi-4]https://huggingface.co/microsoft/phi-4) to INT4 data type.
 This optimization reduces the number of bits per parameter from 16 to 4, reducing the disk size and GPU memory requirements by approximately 75%.
 Only the weights of the linear operators within transformers blocks are quantized.

 ### Model Optimizations
+This model was obtained by quantizing the weights of [phi-4](https://huggingface.co/microsoft/phi-4) to INT4 data type.
 This optimization reduces the number of bits per parameter from 16 to 4, reducing the disk size and GPU memory requirements by approximately 75%.
 Only the weights of the linear operators within transformers blocks are quantized.