parakeet-rnnt-0.6b โ GGUF
GGUF conversions of nvidia/parakeet-rnnt-0.6b for use with CrispASR.
Files
| File | Size | Quantization |
|---|---|---|
parakeet-rnnt-0.6b-q4_k.gguf |
~447 MB | Q4_K (recommended) |
parakeet-rnnt-0.6b-f16.gguf |
~1.2 GB | F16 (full precision) |
Usage
crispasr --backend parakeet \
-m parakeet-rnnt-0.6b-q4_k.gguf \
-f audio.wav
Or let CrispASR auto-download:
crispasr --backend parakeet-rnnt-0.6b -f audio.wav
Architecture
- Model: standard RNN-Transducer (no TDT duration head)
- Encoder: 24-layer FastConformer, 80-mel input, d_model=1024
- Decoder: RNN predictor (hidden=640) + joint network (hidden=640)
- Vocab: 1024-token BPE (lowercase English)
- Sample rate: 16 kHz
The RNNT decoder is auto-detected at runtime via n_tdt_durations==0.
License
nvidia/parakeet-rnnt-0.6b is released under the CC BY 4.0 license.
- Downloads last month
- 160
Hardware compatibility
Log In to add your hardware
16-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support
Model tree for cstr/parakeet-rnnt-0.6b-GGUF
Base model
nvidia/parakeet-rnnt-0.6b