Copperside
/

CoppersideASR

Automatic Speech Recognition

Model card Files Files and versions

CoppersideASR / README.md

Copperside's picture

Add CoppersideASR ONNX model bundle

efe8dd4 verified 13 days ago

|

history blame contribute delete

1.53 kB

	---
	license: mit
	library_name: onnx
	tags:
	- automatic-speech-recognition
	- speech
	- russian
	- onnx
	- rnnt
	- gigaam
	language:
	- ru
	base_model:
	- ai-sage/GigaAM-v3
	---

	# CoppersideASR

	CoppersideASR is an ONNX RNN-T speech recognition model bundle for the
	CopperASR runtime. The model is configured for Russian speech recognition and
	16 kHz audio input.

	The repository uses a minimal normalized layout: three ONNX graphs, a token
	mapping, and a runtime manifest. CopperASR can load this bundle directly without
	additional conversion steps.

	## Files

	- `encoder.onnx`
	- `decoder.onnx`
	- `joint.onnx`
	- `tokens.txt`
	- `model_manifest.json`

	## Runtime Contract

	\| File \| Purpose \|
	\| --- \| --- \|
	\| `encoder.onnx` \| RNN-T acoustic encoder \|
	\| `decoder.onnx` \| RNN-T prediction network \|
	\| `joint.onnx` \| RNN-T joint network \|
	\| `tokens.txt` \| Token id mapping with `<blk>` at id `1024` \|
	\| `model_manifest.json` \| Runtime contract and checksums \|

	## Usage with CopperASR

	```python
	from copper_asr import CopperASR

	asr = CopperASR(model_source="copperside-gigaam-v3-e2e-rnnt")
	result = asr.transcribe("audio.wav")
	print(result.full_text)
	```

	For a local checkout:

	```python
	asr = CopperASR(
	model_source="copperside-gigaam-v3-e2e-rnnt",
	model_path="path/to/CoppersideASR",
	)
	```

	## Model Details

	The model uses an RNN-T layout with separate encoder, decoder, and joint ONNX
	graphs. Tokenizer settings, feature extraction parameters, decoder limits, and
	file checksums are recorded in `model_manifest.json`.