--- license: mit library_name: onnx tags: - automatic-speech-recognition - speech - russian - onnx - rnnt - gigaam language: - ru base_model: - ai-sage/GigaAM-v3 --- # CoppersideASR CoppersideASR is an ONNX RNN-T speech recognition model bundle for the CopperASR runtime. The model is configured for Russian speech recognition and 16 kHz audio input. The repository uses a minimal normalized layout: three ONNX graphs, a token mapping, and a runtime manifest. CopperASR can load this bundle directly without additional conversion steps. ## Files - `encoder.onnx` - `decoder.onnx` - `joint.onnx` - `tokens.txt` - `model_manifest.json` ## Runtime Contract | File | Purpose | | --- | --- | | `encoder.onnx` | RNN-T acoustic encoder | | `decoder.onnx` | RNN-T prediction network | | `joint.onnx` | RNN-T joint network | | `tokens.txt` | Token id mapping with `` at id `1024` | | `model_manifest.json` | Runtime contract and checksums | ## Usage with CopperASR ```python from copper_asr import CopperASR asr = CopperASR(model_source="copperside-gigaam-v3-e2e-rnnt") result = asr.transcribe("audio.wav") print(result.full_text) ``` For a local checkout: ```python asr = CopperASR( model_source="copperside-gigaam-v3-e2e-rnnt", model_path="path/to/CoppersideASR", ) ``` ## Model Details The model uses an RNN-T layout with separate encoder, decoder, and joint ONNX graphs. Tokenizer settings, feature extraction parameters, decoder limits, and file checksums are recorded in `model_manifest.json`.