zeroshot
/

gte-base-sparse

Feature Extraction

sparse sparsity quantized onnx embeddings int8

Model card Files Files and versions

zeroshot commited on Oct 15, 2023

Commit

35ac0f3

·

1 Parent(s): 4eb304a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ language:
 This is the sparse ONNX variant of the [gte-base](https://huggingface.co/thenlper/gte-base) embeddings model created with [DeepSparse Optimum](https://github.com/neuralmagic/optimum-deepsparse) for ONNX export/inference and Neural Magic's [Sparsify](https://github.com/neuralmagic/sparsify) for one-shot quantization (INT8) and unstructured pruning 50%.
-Current list of sparse and quantized gte-small ONNX models:
 | Links                                                                                               | Sparsification Method |
 | --------------------------------------------------------------------------------------------------- | ---------------------- |

 This is the sparse ONNX variant of the [gte-base](https://huggingface.co/thenlper/gte-base) embeddings model created with [DeepSparse Optimum](https://github.com/neuralmagic/optimum-deepsparse) for ONNX export/inference and Neural Magic's [Sparsify](https://github.com/neuralmagic/sparsify) for one-shot quantization (INT8) and unstructured pruning 50%.
+Current list of sparse and quantized gte ONNX models:
 | Links                                                                                               | Sparsification Method |
 | --------------------------------------------------------------------------------------------------- | ---------------------- |