--- license: mit language: - ru base_model: - cointegrated/rubert-tiny2 pipeline_tag: sentence-similarity tags: - onnx - int8 - tiny - sentence-similarity - sentence-transformers --- # RuBERT v2 Tiny (INT8, ONNX) #### This repository contains an INT8-quantized version of RuBERT v2 Tiny, converted to the ONNX format for efficient CPU inference. #### Based on the original model: https://huggingface.co/cointegrated/rubert-tiny2 #### Post-training INT8 quantization #### Optimized for fast and lightweight inference #### Suitable for embeddings, semantic search, and text classification *Note: This is a derivative work with format conversion and quantization only.*