zeroshot
/

gte-small-quant

Feature Extraction

sparse sparsity quantized onnx embeddings int8

Eval Results (legacy)

Model card Files Files and versions

zeroshot commited on Oct 12, 2023

Commit

155e73f

·

1 Parent(s): ada8379

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ Current list of sparse and quantized gte-small ONNX models:
 | Links                                                                                               | Sparsification Method |
 | --------------------------------------------------------------------------------------------------- | ---------------------- |
 | [zeroshot/bge-large-en-v1.5-sparse](https://huggingface.co/zeroshot/gte-small-sparse)     |    Quantization (INT8) & 50% Pruning                    |
-| [zeroshot/bge-large-en-v1.5-quant](https://huggingface.co/zeroshot/gte-small quant)     |   Quantization (INT8)                     |
 BGE models using this architecture:

 | Links                                                                                               | Sparsification Method |
 | --------------------------------------------------------------------------------------------------- | ---------------------- |
 | [zeroshot/bge-large-en-v1.5-sparse](https://huggingface.co/zeroshot/gte-small-sparse)     |    Quantization (INT8) & 50% Pruning                    |
+| [zeroshot/bge-large-en-v1.5-quant](https://huggingface.co/zeroshot/gte-small-quant)     |   Quantization (INT8)                     |
 BGE models using this architecture: