OpenVoiceOS
/

parakeet-rnnt-0.6b-coreml

Automatic Speech Recognition

Model card Files Files and versions

parakeet-rnnt-0.6b-coreml / README.md

Jarbas's picture

Upload folder using huggingface_hub

83c67ca verified about 1 month ago

|

history blame contribute delete

1.27 kB

	---
	license: cc-by-4.0
	language:
	- en
	tags:
	- asr
	- speech
	- coreml
	- nemo
	- parakeet
	- nvidia
	library_name: coremltools
	pipeline_tag: automatic-speech-recognition
	base_model: nvidia/parakeet-rnnt-0.6b
	---

	# parakeet-rnnt-0.6b-coreml

	CoreML conversion of [nvidia/parakeet-rnnt-0.6b](https://huggingface.co/nvidia/parakeet-rnnt-0.6b).

	\| \| \|
	\|---\|---\|
	\| Architecture \| RNNT \|
	\| Language \| English \|
	\| Sample rate \| 16000 Hz \|
	\| Max audio \| 15.0s \|
	\| Vocab size \| 1024 \|
	\| Framework \| NVIDIA NeMo → CoreML (coremltools) \|

	## Components

	\| File \| Component \| Best compute \|
	\|------\|-----------\|--------------\|
	\| `parakeet_mel_encoder.mlpackage` \| mel_encoder \| ANE / GPU \|
	\| `parakeet_decoder.mlpackage` \| decoder \| CPU only \|
	\| `parakeet_joint_decision_single_step.mlpackage` \| joint_decision_single_step \| ANE / GPU \|

	## Usage

	```bash
	pip install ovos-stt-plugin-coreml
	```

	```python
	from ovos_stt_plugin_coreml import CoremlSTT
	from ovos_plugin_manager.utils.audio import AudioFile

	stt = CoremlSTT(config={"metadata": "metadata.json"})

	with AudioFile("speech.wav") as f:
	audio = f.read()
	print(stt.execute(audio))
	```

	## Source model

	[nvidia/parakeet-rnnt-0.6b](https://huggingface.co/nvidia/parakeet-rnnt-0.6b)