canary-mlx / README.md
qfuxa's picture
Update README.md
1d2d321 verified
---
library_name: mlx
tags:
- mlx
- speech-recognition
- asr
- canary
- apple-silicon
license: cc-by-4.0
language:
- en
- bg
- hr
- cs
- da
- nl
- et
- fi
- fr
- de
- el
- hu
- it
- lv
- lt
- mt
- pl
- pt
- ro
- sk
- sl
- es
- sv
- ru
- uk
---
# Canary MLX
NVIDIA Canary ASR model converted to MLX format for Apple Silicon.
## Usage
```bash
pip install canary-mlx
```
```python
from canary_mlx import load_model
model = load_model("qfuxa/canary-mlx")
result = model.transcribe("audio.wav", language="en")
print(result)
```
## Model Details
This model is a conversion of NVIDIA's Canary ASR model to Apple's MLX framework.
- **Architecture**: Conformer encoder + Transformer decoder
- **Parameters**: ~1B
- **Supported Languages**: 25 languages (see tags)
## Original Model
Based on NVIDIA NeMo Canary model. See [NVIDIA NeMo](https://github.com/NVIDIA/NeMo) for the original implementation.
## License
Model weights are released under CC-BY-4.0 license (same as original NVIDIA model).