| --- |
| license: apache-2.0 |
| base_model: openai/whisper-base.en |
| pipeline_tag: automatic-speech-recognition |
| library_name: openasr |
| tags: |
| - automatic-speech-recognition |
| - speech-to-text |
| - openasr |
| - oasr |
| - whisper-base.en |
| --- |
| |
| <div align="center"> |
|
|
| # Whisper Base (English) Β· OpenASR |
|
|
| **Compact English-only Whisper for fast English transcription** |
|
|
| [](https://huggingface.co/openai/whisper-base.en/blob/main/README.md) |
| [](https://github.com/QuintinShaw/openasr) |
| [](https://openasr.org) |
| [](https://huggingface.co/openai/whisper-base.en) |
|
|
| Native speech-to-text in the **[OpenASR](https://github.com/QuintinShaw/openasr)** runtime β |
| engineered for peak performance on CPU & GPU, **no Python at inference time**. |
|
|
| </div> |
|
|
| --- |
|
|
| ## β¨ Highlights |
|
|
| - π¬π§ **English-only** β specialized for English, typically more accurate on English than the same-size multilingual model |
| - πͺΆ **74M parameters** β small and fast, a clear step up from tiny.en |
| - π **Weak-supervision scale** β trained with Whisper's 680k-hour labelled speech corpus |
| - π¦ **Native in OpenASR** β `.oasr` packs run with no Python at inference, engineered for peak performance on CPU & GPU |
|
|
| ## π Quickstart |
|
|
| ```bash |
| # 1. Install the OpenASR CLI Β· https://openasr.org |
| # 2. Pull a build (pick a quant β see the table below) |
| openasr pull whisper-base.en:q8 |
| |
| # 3. Transcribe |
| openasr transcribe audio.wav --model whisper-base.en |
| ``` |
|
|
| All builds for this model: |
|
|
| ```bash |
| openasr pull whisper-base.en:fp16 |
| openasr pull whisper-base.en:q8 |
| openasr pull whisper-base.en:q4 |
| ``` |
|
|
| ## π¦ Available builds |
|
|
| | Quant | File (`.oasr`) | Size | RAM peak | RTF Β· M1 CPU | RTF Β· M1 GPU | JFK ΞWER vs fp16 | |
| |:------|:---------------|-----:|---------:|-------------:|-------------:|-----------------:| |
| | fp16 | `whisper-base.en-fp16.oasr` | 149 MB | 542 MB | 0.06Γ | 0.07Γ | 0.0% | |
| | q8_0 | `whisper-base.en-q8_0.oasr` | 108 MB | 402 MB | 0.05Γ | 0.05Γ | 0.0% | |
| | q4_k | `whisper-base.en-q4_k.oasr` | 86 MB | 369 MB | 0.05Γ | 0.07Γ | 0.0% | |
|
|
| <sub>RTF = real-time factor on the fixed 11s JFK clip (**lower is faster**); RAM peak measured per pack |
| in an isolated subprocess. JFK ΞWER compares each quantized build's JFK transcript to this model's |
| fp16 JFK transcript, so it measures quantization drift rather than absolute recognition accuracy. |
| **q8_0** is the recommended default β near-reference quality at a fraction of the |
| footprint.</sub> |
| |
| ## π§ About Whisper Base (English) |
| |
| Whisper Base.en is OpenAI's 74M-parameter English-only Whisper checkpoint. It uses the standard |
| Whisper encoder-decoder architecture for automatic speech recognition, trained with large-scale |
| weak supervision on 680k hours of labelled speech. As an English-specialized model it tends to |
| outperform the same-size multilingual Whisper on English audio, while staying small and fast. |
| This OpenASR repo repackages the original `openai/whisper-base.en` weights as `.oasr` packs that |
| run natively in the OpenASR runtime with no Python at inference time. For most users the q8_0 |
| build is the recommended default; q4_k is for tighter memory budgets and fp16 is for |
| verification or maximum fidelity. |
| |
| ## βοΈ How these packs were made |
| |
| Converted from [openai/whisper-base.en](https://huggingface.co/openai/whisper-base.en) with the OpenASR importer: |
| |
| ```bash |
| openasr model-pack import-whisper-local <src> <out>.oasr \ |
| --package-id whisper-base.en --quantization {fp16,q8-0,q4-k} |
| ``` |
| |
| The `.oasr` container is GGUF-backed; packs use zero-copy mmap weight binding and graph |
| buffer reuse to keep peak memory low. |
| |
| ## βοΈ License |
| |
| These packs **inherit the upstream model's license: Apache-2.0** |
| ([source](https://huggingface.co/openai/whisper-base.en/blob/main/README.md)). OpenASR packaging retains the upstream copyright and |
| NOTICE; the only modifications are format conversion and quantization. |
| |
| ## π Acknowledgements |
| |
| This pack is a redistribution of **Whisper Base.en**, released by **OpenAI** |
| ([openai/whisper-base.en](https://huggingface.co/openai/whisper-base.en)). |
| All credit for the original model, training recipe, and weights belongs to OpenAI. The |
| upstream Hugging Face model card declares Apache-2.0 licensing; OpenASR only converts the |
| weights into `.oasr` packages and adds quantized builds for local runtime use. |
| |
| ## π Links |
| |
| - π¦ **OpenASR** β <https://github.com/QuintinShaw/openasr> |
| - π **Website** β <https://openasr.org> |
| - π€ **Upstream model** β [openai/whisper-base.en](https://huggingface.co/openai/whisper-base.en) |
| |