Distil Whisper Large v3 (ggml)

Model Details

Architecture: Whisper encoder (32 layers, 1280-dim) + distilled decoder (2 layers only)
Parameters: 756M (49% smaller than whisper-large-v3)
Speed: 6.3x faster than whisper-large-v3, within 1% WER
Language: English
License: MIT

# Uses the standard whisper backend (auto-detected)
crispasr -m distil-large-v3-q5_0.bin -f audio.wav

File	Size	JFK Result
distil-large-v3.bin	1.5 GB	perfect
distil-large-v3-q5_0.bin	513 MB	perfect

Downloads last month: -; Downloads are not tracked for this model. How to track

Base model

Finetuned

(15)

this model