sentence-transformers
/

all-MiniLM-L6-v2

Sentence Similarity

sentence-transformers

feature-extraction

text-embeddings-inference

Model card Files Files and versions

Quantization training technique used for all miniLM L6 v2 quantized model.

#100

by learnerX - opened Feb 4, 2025

Which quantization training technique used for all miniLM L6 v2 quantized model ??

Sentence Transformers org Feb 12, 2025

Hello!

It depends, we have (u)int8 quantized models using arm64, avx2, avx512, avx512_vnni quantization as defined here: https://huggingface.co/docs/optimum/main/en/onnxruntime/package_reference/configuration#optimum.onnxruntime.AutoQuantizationConfig
And with OpenVINO, we use Static Quantization as described here: https://huggingface.co/docs/optimum/main/en/intel/openvino/optimization#static-quantization

Tom Aarsen

Will quantization aware training technique can preserve the accuracy of all mini lm l6v2 model if we want to apply this technique ??

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment