Whisper Tiny (English) · OpenASR

The smallest English-only Whisper, fastest for English speech

Native speech-to-text in the OpenASR runtime — engineered for peak performance on CPU & GPU, no Python at inference time.

✨ Highlights

🇬🇧 English-only — specialized for English, typically more accurate on English than the same-size multilingual model
⚡ 39M parameters — the smallest, fastest, and lightest Whisper checkpoint
🌐 Weak-supervision scale — trained with Whisper's 680k-hour labelled speech corpus
🦀 Native in OpenASR — .oasr packs run with no Python at inference, engineered for peak performance on CPU & GPU

🚀 Quickstart

# 1. Install the OpenASR CLI  ·  https://openasr.org
# 2. Pull a build (pick a quant — see the table below)
openasr pull whisper-tiny.en:q8

# 3. Transcribe
openasr transcribe audio.wav --model whisper-tiny.en

All builds for this model:

openasr pull whisper-tiny.en:fp16
openasr pull whisper-tiny.en:q8
openasr pull whisper-tiny.en:q4

📦 Available builds

Quant	File (`.oasr`)	Size	RAM peak	RTF · M1 CPU	RTF · M1 GPU	JFK ΔWER vs fp16
fp16	`whisper-tiny.en-fp16.oasr`	79 MB	325 MB	0.05×	0.05×	0.0%
q8_0	`whisper-tiny.en-q8_0.oasr`	63 MB	277 MB	0.05×	0.04×	0.0%
q4_k	`whisper-tiny.en-q4_k.oasr`	61 MB	271 MB	0.05×	0.05×	0.0%

_{RTF = real-time factor on the fixed 11s JFK clip (lower is faster); RAM peak measured per pack
in an isolated subprocess. JFK ΔWER compares each quantized build's JFK transcript to this model's
fp16 JFK transcript, so it measures quantization drift rather than absolute recognition accuracy.
q8_0 is the recommended default — near-reference quality at a fraction of the
footprint.}

🧠 About Whisper Tiny (English)

Whisper Tiny.en is OpenAI's 39M-parameter English-only Whisper checkpoint. It uses the standard Whisper encoder-decoder architecture for automatic speech recognition, trained with large-scale weak supervision on 680k hours of labelled speech. As an English-specialized model it tends to outperform the same-size multilingual Whisper on English audio, at the lowest footprint and fastest inference in the family. This OpenASR repo repackages the original openai/whisper-tiny.en weights as .oasr packs that run natively in the OpenASR runtime with no Python at inference time. For most users the q8_0 build is the recommended default; q4_k is for the tightest memory budgets and fp16 is for verification or maximum fidelity.

⚙️ How these packs were made

Converted from openai/whisper-tiny.en with the OpenASR importer:

openasr model-pack import-whisper-local <src> <out>.oasr \
  --package-id whisper-tiny.en --quantization {fp16,q8-0,q4-k}

The .oasr container is GGUF-backed; packs use zero-copy mmap weight binding and graph buffer reuse to keep peak memory low.

⚖️ License

These packs inherit the upstream model's license: Apache-2.0 (source). OpenASR packaging retains the upstream copyright and NOTICE; the only modifications are format conversion and quantization.

🙏 Acknowledgements

This pack is a redistribution of Whisper Tiny.en, released by OpenAI (openai/whisper-tiny.en). All credit for the original model, training recipe, and weights belongs to OpenAI. The upstream Hugging Face model card declares Apache-2.0 licensing; OpenASR only converts the weights into .oasr packages and adds quantized builds for local runtime use.

🔗 Links

🦀 OpenASR — https://github.com/QuintinShaw/openasr
🌐 Website — https://openasr.org
🤗 Upstream model — openai/whisper-tiny.en

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for OpenASR/whisper-tiny.en

Base model

openai/whisper-tiny.en

Finetuned

(86)

this model