Canary MLX

NVIDIA Canary ASR model converted to MLX format for Apple Silicon.

Usage

pip install canary-mlx
from canary_mlx import load_model

model = load_model("qfuxa/canary-mlx")
result = model.transcribe("audio.wav", language="en")
print(result)

Model Details

This model is a conversion of NVIDIA's Canary ASR model to Apple's MLX framework.

  • Architecture: Conformer encoder + Transformer decoder
  • Parameters: ~1B
  • Supported Languages: 25 languages (see tags)

Original Model

Based on NVIDIA NeMo Canary model. See NVIDIA NeMo for the original implementation.

License

Model weights are released under CC-BY-4.0 license (same as original NVIDIA model).

Downloads last month
23
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support