Add exported openvino model 'openvino_model_qint8_quantized.xml'

#16

by thomasht86 - opened May 15, 2025

base: refs/heads/main

←

from: refs/pr/16

Discussion Files changed

May 15, 2025

sentence-transformers/backend-export

Hello!

This pull request has been automatically generated from the export_static_quantized_openvino_model function from the Sentence Transformers library.

Config

OVQuantizationConfig(
    quant_method=<OVQuantizationMethod.DEFAULT: 'default'>
)

Tip:

Consider testing this pull request before merging by loading the model from this PR with the revision argument:

from sentence_transformers import SentenceTransformer

# TODO: Fill in the PR number
pr_number = 2
model = SentenceTransformer(
    "Alibaba-NLP/gte-modernbert-base",
    revision=f"refs/pr/{pr_number}",
    backend="openvino",
    model_kwargs={"file_name": "openvino_model_qint8_quantized.xml"},
)

# Verify that everything works as expected
embeddings = model.encode(["The weather is lovely today.", "It's so sunny outside!", "He drove to the stadium."])
print(embeddings.shape)

similarities = model.similarity(embeddings, embeddings)
print(similarities)

Add exported openvino model 'openvino_model_qint8_quantized.xml'87271960

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment