zeroshot
/

gte-base-quant

Feature Extraction

sparse sparsity quantized onnx embeddings int8

Model card Files Files and versions

zeroshot commited on Oct 15, 2023

Commit

8d78109

·

1 Parent(s): fbba993

Update README.md

Files changed (1) hide show

README.md +0 -11

README.md CHANGED Viewed

@@ -21,17 +21,6 @@ Current list of sparse and quantized gte ONNX models:
 | [zeroshot/gte-small-sparse](https://huggingface.co/zeroshot/gte-small-sparse)     |    Quantization (INT8) & 50% Pruning                    |
 | [zeroshot/gte-small-quant](https://huggingface.co/zeroshot/gte-small-quant)     |   Quantization (INT8)                     |
-BGE models using this architecture:
-| Links                                                                                               | Sparsification Method |
-| --------------------------------------------------------------------------------------------------- | ---------------------- |
-| [zeroshot/bge-large-en-v1.5-sparse](https://huggingface.co/zeroshot/bge-large-en-v1.5-sparse)     |    Quantization (INT8) & 50% Pruning                    |
-| [zeroshot/bge-large-en-v1.5-quant](https://huggingface.co/zeroshot/bge-large-en-v1.5-quant)     |   Quantization (INT8)                     |
-| [zeroshot/bge-base-en-v1.5-sparse](https://huggingface.co/zeroshot/bge-base-en-v1.5-sparse)     |   Quantization (INT8) & 50% Pruning                     |
-| [zeroshot/bge-base-en-v1.5-quant](https://huggingface.co/zeroshot/bge-base-en-v1.5-quant)     |     Quantization (INT8)                    |
-| [zeroshot/bge-small-en-v1.5-sparse](https://huggingface.co/zeroshot/bge-small-en-v1.5-sparse) |    Quantization (INT8) & 50% Pruning                    |
-| [zeroshot/bge-small-en-v1.5-quant](https://huggingface.co/zeroshot/bge-small-en-v1.5-quant) |     Quantization (INT8)                    |
 For general questions on these models and sparsification methods, reach out to the engineering team on our [community Slack](https://join.slack.com/t/discuss-neuralmagic/shared_invite/zt-q1a1cnvo-YBoICSIw3L1dmQpjBeDurQ).

 | [zeroshot/gte-small-sparse](https://huggingface.co/zeroshot/gte-small-sparse)     |    Quantization (INT8) & 50% Pruning                    |
 | [zeroshot/gte-small-quant](https://huggingface.co/zeroshot/gte-small-quant)     |   Quantization (INT8)                     |
 For general questions on these models and sparsification methods, reach out to the engineering team on our [community Slack](https://join.slack.com/t/discuss-neuralmagic/shared_invite/zt-q1a1cnvo-YBoICSIw3L1dmQpjBeDurQ).