Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
cstr
/
Octen-Embedding-0.6B-ONNX-INT8
like
1
Sentence Similarity
ONNX
4 languages
onnxruntime
qwen3
embedding
text-embedding
retrieval
feature-extraction
quantized
int8
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
Octen-Embedding-0.6B-ONNX-INT8
1.08 GB
Ctrl+K
Ctrl+K
1 contributor
History:
6 commits
cstr
fix: clean re-quantization (per_channel=False, no stale data appending), 1.0GB
72815b3
verified
19 days ago
.gitattributes
Safe
1.63 kB
Upload folder using huggingface_hub
20 days ago
README.md
Safe
3.8 kB
Upload README.md with huggingface_hub
20 days ago
added_tokens.json
Safe
707 Bytes
Upload folder using huggingface_hub
20 days ago
config.json
Safe
1.35 kB
Upload folder using huggingface_hub
20 days ago
merges.txt
Safe
1.67 MB
Upload folder using huggingface_hub
20 days ago
model.int8.onnx
5.41 MB
xet
Upload model.int8.onnx with huggingface_hub
20 days ago
model.int8.onnx.data
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.06 GB
xet
fix: clean re-quantization (per_channel=False, no stale data appending), 1.0GB
19 days ago
quantize_octen_int8.py
Safe
5.67 kB
Upload folder using huggingface_hub
20 days ago
special_tokens_map.json
Safe
613 Bytes
Upload folder using huggingface_hub
20 days ago
tokenizer.json
Safe
11.4 MB
xet
Upload folder using huggingface_hub
20 days ago
tokenizer_config.json
Safe
5.4 kB
Upload folder using huggingface_hub
20 days ago
vocab.json
Safe
2.78 MB
Upload folder using huggingface_hub
20 days ago