File size: 672 Bytes
63d475c d15f022 63d475c d15f022 df501f9 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 |
---
license: mit
language:
- ru
base_model:
- cointegrated/rubert-tiny2
pipeline_tag: sentence-similarity
tags:
- onnx
- int8
- tiny
- sentence-similarity
- sentence-transformers
---
# RuBERT v2 Tiny (INT8, ONNX)
#### This repository contains an INT8-quantized version of RuBERT v2 Tiny, converted to the ONNX format for efficient CPU inference.
#### Based on the original model: https://huggingface.co/cointegrated/rubert-tiny2
#### Post-training INT8 quantization
#### Optimized for fast and lightweight inference
#### Suitable for embeddings, semantic search, and text classification
*Note: This is a derivative work with format conversion and quantization only.* |