Automatic Speech Recognition
NeMo
Safetensors
PyTorch
fastconformer
automatic-speech-translation
speech
audio
Transformer
FastConformer
Conformer
NeMo
hf-asr-leaderboard
Eval Results (legacy)
Eval Results
Instructions to use nvidia/canary-1b-flash with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use nvidia/canary-1b-flash with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("nvidia/canary-1b-flash") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
base model encoder choice
#12
by sugintama - opened
For base model's encoder choice, is an encoder with a different FastConformer-like structure compatible with this SALM, such as nvidia/parakeet-tdt-0.6b-v2, or must it strictly be an encoder which is combined with a transformer decoder in base model?
sugintama changed discussion status to closed