File size: 1,534 Bytes
efe8dd4 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 | ---
license: mit
library_name: onnx
tags:
- automatic-speech-recognition
- speech
- russian
- onnx
- rnnt
- gigaam
language:
- ru
base_model:
- ai-sage/GigaAM-v3
---
# CoppersideASR
CoppersideASR is an ONNX RNN-T speech recognition model bundle for the
CopperASR runtime. The model is configured for Russian speech recognition and
16 kHz audio input.
The repository uses a minimal normalized layout: three ONNX graphs, a token
mapping, and a runtime manifest. CopperASR can load this bundle directly without
additional conversion steps.
## Files
- `encoder.onnx`
- `decoder.onnx`
- `joint.onnx`
- `tokens.txt`
- `model_manifest.json`
## Runtime Contract
| File | Purpose |
| --- | --- |
| `encoder.onnx` | RNN-T acoustic encoder |
| `decoder.onnx` | RNN-T prediction network |
| `joint.onnx` | RNN-T joint network |
| `tokens.txt` | Token id mapping with `<blk>` at id `1024` |
| `model_manifest.json` | Runtime contract and checksums |
## Usage with CopperASR
```python
from copper_asr import CopperASR
asr = CopperASR(model_source="copperside-gigaam-v3-e2e-rnnt")
result = asr.transcribe("audio.wav")
print(result.full_text)
```
For a local checkout:
```python
asr = CopperASR(
model_source="copperside-gigaam-v3-e2e-rnnt",
model_path="path/to/CoppersideASR",
)
```
## Model Details
The model uses an RNN-T layout with separate encoder, decoder, and joint ONNX
graphs. Tokenizer settings, feature extraction parameters, decoder limits, and
file checksums are recorded in `model_manifest.json`.
|