rubert-tiny2-int8 / README.md
trend
UPD
df501f9 verified
metadata
license: mit
language:
  - ru
base_model:
  - cointegrated/rubert-tiny2
pipeline_tag: sentence-similarity
tags:
  - onnx
  - int8
  - tiny
  - sentence-similarity
  - sentence-transformers

RuBERT v2 Tiny (INT8, ONNX)

This repository contains an INT8-quantized version of RuBERT v2 Tiny, converted to the ONNX format for efficient CPU inference.

Based on the original model: https://huggingface.co/cointegrated/rubert-tiny2

Post-training INT8 quantization

Optimized for fast and lightweight inference

Suitable for embeddings, semantic search, and text classification

Note: This is a derivative work with format conversion and quantization only.