Update README.md
Browse files
README.md
CHANGED
|
@@ -27,7 +27,7 @@ widget:
|
|
| 27 |
It uses dynamic quantization for lighter deployment and faster inference.
|
| 28 |
|
| 29 |
Original model: **float16**, ~6.4GB
|
| 30 |
-
Quantized model: **int8 dynamic**, ~6.4GB
|
| 31 |
|
| 32 |
## ⚡️ Quickstart
|
| 33 |
|
|
|
|
| 27 |
It uses dynamic quantization for lighter deployment and faster inference.
|
| 28 |
|
| 29 |
Original model: **float16**, ~6.4GB
|
| 30 |
+
Quantized model: **int8 dynamic**, ~6.4GB, ~20% faster inference
|
| 31 |
|
| 32 |
## ⚡️ Quickstart
|
| 33 |
|