canary-mlx / README.md
qfuxa's picture
Update README.md
1d2d321 verified
metadata
library_name: mlx
tags:
  - mlx
  - speech-recognition
  - asr
  - canary
  - apple-silicon
license: cc-by-4.0
language:
  - en
  - bg
  - hr
  - cs
  - da
  - nl
  - et
  - fi
  - fr
  - de
  - el
  - hu
  - it
  - lv
  - lt
  - mt
  - pl
  - pt
  - ro
  - sk
  - sl
  - es
  - sv
  - ru
  - uk

Canary MLX

NVIDIA Canary ASR model converted to MLX format for Apple Silicon.

Usage

pip install canary-mlx
from canary_mlx import load_model

model = load_model("qfuxa/canary-mlx")
result = model.transcribe("audio.wav", language="en")
print(result)

Model Details

This model is a conversion of NVIDIA's Canary ASR model to Apple's MLX framework.

  • Architecture: Conformer encoder + Transformer decoder
  • Parameters: ~1B
  • Supported Languages: 25 languages (see tags)

Original Model

Based on NVIDIA NeMo Canary model. See NVIDIA NeMo for the original implementation.

License

Model weights are released under CC-BY-4.0 license (same as original NVIDIA model).