Distil Whisper Large v3 (ggml)

ggml conversion of distil-whisper/distil-large-v3 for use with CrispASR and whisper.cpp.

Model Details

  • Architecture: Whisper encoder (32 layers, 1280-dim) + distilled decoder (2 layers only)
  • Parameters: 756M (49% smaller than whisper-large-v3)
  • Speed: 6.3x faster than whisper-large-v3, within 1% WER
  • Language: English
  • License: MIT

Usage

# Uses the standard whisper backend (auto-detected)
crispasr -m distil-large-v3-q5_0.bin -f audio.wav

Files

File Size JFK Result
distil-large-v3.bin 1.5 GB perfect
distil-large-v3-q5_0.bin 513 MB perfect

Why Distil Whisper?

  • 6.3x faster than whisper-large-v3 (2 decoder layers vs 32)
  • Within 1% WER on standard benchmarks
  • Same encoder as whisper-large-v3 (32 layers, 1280-dim)
  • Drop-in replacement โ€” same ggml format, same CLI flags
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for cstr/distil-large-v3-GGUF

Finetuned
(15)
this model