File size: 672 Bytes
63d475c
 
 
d15f022
 
 
63d475c
 
d15f022
 
 
 
 
df501f9
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
license: mit
language:
- ru
base_model:
- cointegrated/rubert-tiny2
pipeline_tag: sentence-similarity
tags:
- onnx
- int8
- tiny
- sentence-similarity
- sentence-transformers
---
# RuBERT v2 Tiny (INT8, ONNX)

#### This repository contains an INT8-quantized version of RuBERT v2 Tiny, converted to the ONNX format for efficient CPU inference.

#### Based on the original model: https://huggingface.co/cointegrated/rubert-tiny2

#### Post-training INT8 quantization

#### Optimized for fast and lightweight inference

#### Suitable for embeddings, semantic search, and text classification

*Note: This is a derivative work with format conversion and quantization only.*