qfuxa
/

canary-mlx

speech-recognition

Model card Files Files and versions

canary-mlx / README.md

qfuxa's picture

Update README.md

1d2d321 verified 20 days ago

|

history blame contribute delete

1.06 kB

	---
	library_name: mlx
	tags:
	- mlx
	- speech-recognition
	- asr
	- canary
	- apple-silicon
	license: cc-by-4.0
	language:
	- en
	- bg
	- hr
	- cs
	- da
	- nl
	- et
	- fi
	- fr
	- de
	- el
	- hu
	- it
	- lv
	- lt
	- mt
	- pl
	- pt
	- ro
	- sk
	- sl
	- es
	- sv
	- ru
	- uk
	---

	# Canary MLX

	NVIDIA Canary ASR model converted to MLX format for Apple Silicon.

	## Usage

	```bash
	pip install canary-mlx
	```

	```python
	from canary_mlx import load_model

	model = load_model("qfuxa/canary-mlx")
	result = model.transcribe("audio.wav", language="en")
	print(result)
	```

	## Model Details

	This model is a conversion of NVIDIA's Canary ASR model to Apple's MLX framework.

	- Architecture: Conformer encoder + Transformer decoder
	- Parameters: ~1B
	- Supported Languages: 25 languages (see tags)

	## Original Model

	Based on NVIDIA NeMo Canary model. See [NVIDIA NeMo](https://github.com/NVIDIA/NeMo) for the original implementation.

	## License

	Model weights are released under CC-BY-4.0 license (same as original NVIDIA model).