electroglyph
/

embeddinggemma-300m-ONNX-uint8

Sentence Similarity

Transformers.js

feature-extraction

text-embeddings-inference

Model card Files Files and versions

electroglyph commited on Sep 20, 2025

Commit

a860ce3

·

verified ·

1 Parent(s): 13fb8fd

Upload folder using huggingface_hub

Files changed (2) hide show

README.md +2 -0
onnx/model.onnx +2 -2

README.md CHANGED Viewed

@@ -10,6 +10,8 @@ tags:
 # embeddinggemma-300m-ONNX-uint8
 This is based on https://huggingface.co/onnx-community/embeddinggemma-300m-ONNX/blob/main/onnx/model_quantized.onnx, but it outputs a uint8 tensor instead of an f32 one.
 This model is compatible with Qdrant, but I'm not sure what other vector DBs it's compatible with.

 # embeddinggemma-300m-ONNX-uint8
+Update Sep. 20, 2025: I removed the last_hidden_state output from the model and left only the sentence_embedding one.
 This is based on https://huggingface.co/onnx-community/embeddinggemma-300m-ONNX/blob/main/onnx/model_quantized.onnx, but it outputs a uint8 tensor instead of an f32 one.
 This model is compatible with Qdrant, but I'm not sure what other vector DBs it's compatible with.

onnx/model.onnx CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:21585443cf1ee0e87ba306ba9b1b97761d0aa3666f96947f8e65123dfee06688
-size 309435349

 version https://git-lfs.github.com/spec/v1
+oid sha256:cd28a6bf4d485ae180857da232c188fedb53b00fc31452f019720d23c003d2eb
+size 309435276