Instructions to use OpenVoiceOS/parakeet-ctc-0.6b-coreml with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use OpenVoiceOS/parakeet-ctc-0.6b-coreml with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("OpenVoiceOS/parakeet-ctc-0.6b-coreml") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
File size: 1,057 Bytes
5ee7698 eca44ea 5ee7698 eca44ea 5ee7698 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 | {
"model_id": "nvidia/parakeet-ctc-0.6b",
"model_type": "ctc",
"language": "",
"sample_rate": 16000,
"max_audio_seconds": 15.0,
"max_audio_samples": 240000,
"vocab_size": 1024,
"blank_id": 1024,
"checkpoint": {
"type": "pretrained",
"model_id": "nvidia/parakeet-ctc-0.6b"
},
"coreml": {
"compute_precision": "FLOAT32",
"quantization": "none"
},
"components": {
"mel_encoder": {
"path": "parakeet_mel_encoder.mlpackage",
"inputs": {
"audio_signal": [
1,
240000
],
"audio_length": [
1
]
},
"outputs": {
"encoder": [
1,
1024,
188
],
"encoder_length": [
1
]
}
},
"ctc_decoder": {
"path": "parakeet_ctc_decoder.mlpackage",
"inputs": {
"encoder": [
1,
1024,
188
]
},
"outputs": {
"log_probs": [
1,
188,
1025
]
}
}
}
} |