Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -16,14 +16,14 @@ ONNX export of [unknown](https://huggingface.co/unknown) for fast CPU inference.
|
|
| 16 |
|
| 17 |
- **Source Model**: [unknown](https://huggingface.co/unknown)
|
| 18 |
- **Embedding Dimension**: unknown
|
| 19 |
-
- **Format**: ONNX (FP32)
|
| 20 |
|
| 21 |
## Files
|
| 22 |
|
| 23 |
| File | Description |
|
| 24 |
|------|-------------|
|
| 25 |
| `model.onnx` | FP32 ONNX model |
|
| 26 |
-
|
| 27 |
| `tokenizer.json` | Tokenizer configuration |
|
| 28 |
| `config_sentence_transformers.json` | Model configuration |
|
| 29 |
|
|
|
|
| 16 |
|
| 17 |
- **Source Model**: [unknown](https://huggingface.co/unknown)
|
| 18 |
- **Embedding Dimension**: unknown
|
| 19 |
+
- **Format**: ONNX (FP32 + INT8)
|
| 20 |
|
| 21 |
## Files
|
| 22 |
|
| 23 |
| File | Description |
|
| 24 |
|------|-------------|
|
| 25 |
| `model.onnx` | FP32 ONNX model |
|
| 26 |
+
| `model_int8.onnx` | INT8 quantized model (faster) |
|
| 27 |
| `tokenizer.json` | Tokenizer configuration |
|
| 28 |
| `config_sentence_transformers.json` | Model configuration |
|
| 29 |
|