Update README.md
Browse files
README.md
CHANGED
|
@@ -4,4 +4,16 @@ base_model:
|
|
| 4 |
- intfloat/multilingual-e5-small
|
| 5 |
---
|
| 6 |
|
| 7 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
- intfloat/multilingual-e5-small
|
| 5 |
---
|
| 6 |
|
| 7 |
+
the original model is OriginalModel.mlpackage with float32
|
| 8 |
+
|
| 9 |
+
# the quantization model info :
|
| 10 |
+
|
| 11 |
+
## quantization and input maxlength
|
| 12 |
+
|
| 13 |
+
coreML: using linear quantize nbit=8
|
| 14 |
+
|
| 15 |
+
input max = 128
|
| 16 |
+
|
| 17 |
+
## note
|
| 18 |
+
|
| 19 |
+
i tried turn it into float`6, but it changed too much for its prediction. quantization using linear nbit=8, it works almost like the original.
|