onnx and openvino - huge files to download

#32

by venkey - opened Oct 25, 2024

Oct 25, 2024

few days ago, Bunch of files onnx and openvino files are included and they are getting downloaded as part of model, as the file sizes are huge, its taking more time, is there anyway we can control this behavior, i.e to download only the required files or specific file type.

tomaarsen

Sentence Transformers org Oct 25, 2024

Hello!

Indeed, the ONNX/OV files accompany the recent v3.2.0 release, which brings faster inference via various means: https://sbert.net/docs/sentence_transformer/usage/efficiency.html

Only old Sentence Transformers versions (2.2.2 or older I believe) download all files from the repository - all newer versions only download the required files.
My strong recommendation is to use a newer Sentence Transformers version. Note that all newer versions are designed to be backwards compatible with loading and computing embeddings.

Tom Aarsen

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment