parakeet-rnnt-0.6b โ€” GGUF

GGUF conversions of nvidia/parakeet-rnnt-0.6b for use with CrispASR.

Files

File Size Quantization
parakeet-rnnt-0.6b-q4_k.gguf ~447 MB Q4_K (recommended)
parakeet-rnnt-0.6b-f16.gguf ~1.2 GB F16 (full precision)

Usage

crispasr --backend parakeet \
  -m parakeet-rnnt-0.6b-q4_k.gguf \
  -f audio.wav

Or let CrispASR auto-download:

crispasr --backend parakeet-rnnt-0.6b -f audio.wav

Architecture

  • Model: standard RNN-Transducer (no TDT duration head)
  • Encoder: 24-layer FastConformer, 80-mel input, d_model=1024
  • Decoder: RNN predictor (hidden=640) + joint network (hidden=640)
  • Vocab: 1024-token BPE (lowercase English)
  • Sample rate: 16 kHz

The RNNT decoder is auto-detected at runtime via n_tdt_durations==0.

License

nvidia/parakeet-rnnt-0.6b is released under the CC BY 4.0 license.

Downloads last month
160
GGUF
Model size
0.6B params
Architecture
parakeet
Hardware compatibility
Log In to add your hardware

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for cstr/parakeet-rnnt-0.6b-GGUF

Quantized
(8)
this model