trend commited on
Commit
df501f9
·
verified ·
1 Parent(s): 841170a
Files changed (1) hide show
  1. README.md +14 -1
README.md CHANGED
@@ -11,4 +11,17 @@ tags:
11
  - tiny
12
  - sentence-similarity
13
  - sentence-transformers
14
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  - tiny
12
  - sentence-similarity
13
  - sentence-transformers
14
+ ---
15
+ # RuBERT v2 Tiny (INT8, ONNX)
16
+
17
+ #### This repository contains an INT8-quantized version of RuBERT v2 Tiny, converted to the ONNX format for efficient CPU inference.
18
+
19
+ #### Based on the original model: https://huggingface.co/cointegrated/rubert-tiny2
20
+
21
+ #### Post-training INT8 quantization
22
+
23
+ #### Optimized for fast and lightweight inference
24
+
25
+ #### Suitable for embeddings, semantic search, and text classification
26
+
27
+ *Note: This is a derivative work with format conversion and quantization only.*