electroglyph
/

snowflake2_m_uint8

Sentence Similarity

sentence-transformers

Transformers.js

feature-extraction

snowflake2_m_uint8

Model card Files Files and versions

electroglyph commited on May 6, 2025

Commit

a9259f7

·

verified ·

1 Parent(s): a17503a

Update README.md

Files changed (1) hide show

README.md +0 -2

README.md CHANGED Viewed

@@ -95,8 +95,6 @@ I have added a linear quantization node before the `sentence_embedding` output s
 This is compatible with the [qdrant](https://github.com/qdrant/qdrant) uint8 datatype for collections.
-No benchmarks, but it in my limited testing it's exactly equivalent to the FP32 output of the uint8 quantized ONNX model.
 # Quantization method
 Linear quantization for the scale -.3 to 0.3, which is what sentence_embedding is normalized to.

 This is compatible with the [qdrant](https://github.com/qdrant/qdrant) uint8 datatype for collections.
 # Quantization method
 Linear quantization for the scale -.3 to 0.3, which is what sentence_embedding is normalized to.