metadata
library_name: mlx
tags:
- mlx
- speech-recognition
- asr
- canary
- apple-silicon
license: cc-by-4.0
language:
- en
- bg
- hr
- cs
- da
- nl
- et
- fi
- fr
- de
- el
- hu
- it
- lv
- lt
- mt
- pl
- pt
- ro
- sk
- sl
- es
- sv
- ru
- uk
Canary MLX
NVIDIA Canary ASR model converted to MLX format for Apple Silicon.
Usage
pip install canary-mlx
from canary_mlx import load_model
model = load_model("qfuxa/canary-mlx")
result = model.transcribe("audio.wav", language="en")
print(result)
Model Details
This model is a conversion of NVIDIA's Canary ASR model to Apple's MLX framework.
- Architecture: Conformer encoder + Transformer decoder
- Parameters: ~1B
- Supported Languages: 25 languages (see tags)
Original Model
Based on NVIDIA NeMo Canary model. See NVIDIA NeMo for the original implementation.
License
Model weights are released under CC-BY-4.0 license (same as original NVIDIA model).