electroglyph commited on
Commit
a860ce3
·
verified ·
1 Parent(s): 13fb8fd

Upload folder using huggingface_hub

Browse files
Files changed (2) hide show
  1. README.md +2 -0
  2. onnx/model.onnx +2 -2
README.md CHANGED
@@ -10,6 +10,8 @@ tags:
10
 
11
  # embeddinggemma-300m-ONNX-uint8
12
 
 
 
13
  This is based on https://huggingface.co/onnx-community/embeddinggemma-300m-ONNX/blob/main/onnx/model_quantized.onnx, but it outputs a uint8 tensor instead of an f32 one.
14
 
15
  This model is compatible with Qdrant, but I'm not sure what other vector DBs it's compatible with.
 
10
 
11
  # embeddinggemma-300m-ONNX-uint8
12
 
13
+ Update Sep. 20, 2025: I removed the last_hidden_state output from the model and left only the sentence_embedding one.
14
+
15
  This is based on https://huggingface.co/onnx-community/embeddinggemma-300m-ONNX/blob/main/onnx/model_quantized.onnx, but it outputs a uint8 tensor instead of an f32 one.
16
 
17
  This model is compatible with Qdrant, but I'm not sure what other vector DBs it's compatible with.
onnx/model.onnx CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:21585443cf1ee0e87ba306ba9b1b97761d0aa3666f96947f8e65123dfee06688
3
- size 309435349
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd28a6bf4d485ae180857da232c188fedb53b00fc31452f019720d23c003d2eb
3
+ size 309435276