Distil Whisper Large v3 (ggml)
ggml conversion of distil-whisper/distil-large-v3 for use with CrispASR and whisper.cpp.
Model Details
- Architecture: Whisper encoder (32 layers, 1280-dim) + distilled decoder (2 layers only)
- Parameters: 756M (49% smaller than whisper-large-v3)
- Speed: 6.3x faster than whisper-large-v3, within 1% WER
- Language: English
- License: MIT
Usage
# Uses the standard whisper backend (auto-detected)
crispasr -m distil-large-v3-q5_0.bin -f audio.wav
Files
| File | Size | JFK Result |
|---|---|---|
| distil-large-v3.bin | 1.5 GB | perfect |
| distil-large-v3-q5_0.bin | 513 MB | perfect |
Why Distil Whisper?
- 6.3x faster than whisper-large-v3 (2 decoder layers vs 32)
- Within 1% WER on standard benchmarks
- Same encoder as whisper-large-v3 (32 layers, 1280-dim)
- Drop-in replacement โ same ggml format, same CLI flags
Model tree for cstr/distil-large-v3-GGUF
Base model
distil-whisper/distil-large-v3