publish whisper-base.en OpenASR packs

94494d8 verified 4 days ago

4.79 kB

	---
	license: apache-2.0
	base_model: openai/whisper-base.en
	pipeline_tag: automatic-speech-recognition
	library_name: openasr
	tags:
	- automatic-speech-recognition
	- speech-to-text
	- openasr
	- oasr
	- whisper-base.en
	---

	<div align="center">

	# Whisper Base (English) · OpenASR

	Compact English-only Whisper for fast English transcription

	[![License](https://img.shields.io/badge/license-Apache--2.0-2563eb.svg)](https://huggingface.co/openai/whisper-base.en/blob/main/README.md)
	[![Format](https://img.shields.io/badge/format-.oasr-7c3aed.svg)](https://github.com/QuintinShaw/openasr)
	[![Runtime](https://img.shields.io/badge/runtime-OpenASR-111827.svg)](https://openasr.org)
	[![Base model](https://img.shields.io/badge/base-whisper--base.en-f59e0b.svg)](https://huggingface.co/openai/whisper-base.en)

	Native speech-to-text in the [OpenASR](https://github.com/QuintinShaw/openasr) runtime —
	engineered for peak performance on CPU & GPU, no Python at inference time.

	</div>

	---

	## ✨ Highlights

	- 🇬🇧 English-only — specialized for English, typically more accurate on English than the same-size multilingual model
	- 🪶 74M parameters — small and fast, a clear step up from tiny.en
	- 🌐 Weak-supervision scale — trained with Whisper's 680k-hour labelled speech corpus
	- 🦀 Native in OpenASR — `.oasr` packs run with no Python at inference, engineered for peak performance on CPU & GPU

	## 🚀 Quickstart

	```bash
	# 1. Install the OpenASR CLI · https://openasr.org
	# 2. Pull a build (pick a quant — see the table below)
	openasr pull whisper-base.en:q8

	# 3. Transcribe
	openasr transcribe audio.wav --model whisper-base.en
	```

	All builds for this model:

	```bash
	openasr pull whisper-base.en:fp16
	openasr pull whisper-base.en:q8
	openasr pull whisper-base.en:q4
	```

	## 📦 Available builds

	\| Quant \| File (`.oasr`) \| Size \| RAM peak \| RTF · M1 CPU \| RTF · M1 GPU \| JFK ΔWER vs fp16 \|
	\|:------\|:---------------\|-----:\|---------:\|-------------:\|-------------:\|-----------------:\|
	\| fp16 \| `whisper-base.en-fp16.oasr` \| 149 MB \| 542 MB \| 0.06× \| 0.07× \| 0.0% \|
	\| q8_0 \| `whisper-base.en-q8_0.oasr` \| 108 MB \| 402 MB \| 0.05× \| 0.05× \| 0.0% \|
	\| q4_k \| `whisper-base.en-q4_k.oasr` \| 86 MB \| 369 MB \| 0.05× \| 0.07× \| 0.0% \|

	<sub>RTF = real-time factor on the fixed 11s JFK clip (lower is faster); RAM peak measured per pack
	in an isolated subprocess. JFK ΔWER compares each quantized build's JFK transcript to this model's
	fp16 JFK transcript, so it measures quantization drift rather than absolute recognition accuracy.
	q8_0 is the recommended default — near-reference quality at a fraction of the
	footprint.</sub>

	## 🧠 About Whisper Base (English)

	Whisper Base.en is OpenAI's 74M-parameter English-only Whisper checkpoint. It uses the standard
	Whisper encoder-decoder architecture for automatic speech recognition, trained with large-scale
	weak supervision on 680k hours of labelled speech. As an English-specialized model it tends to
	outperform the same-size multilingual Whisper on English audio, while staying small and fast.
	This OpenASR repo repackages the original `openai/whisper-base.en` weights as `.oasr` packs that
	run natively in the OpenASR runtime with no Python at inference time. For most users the q8_0
	build is the recommended default; q4_k is for tighter memory budgets and fp16 is for
	verification or maximum fidelity.

	## ⚙️ How these packs were made

	Converted from [openai/whisper-base.en](https://huggingface.co/openai/whisper-base.en) with the OpenASR importer:

	```bash
	openasr model-pack import-whisper-local <src> <out>.oasr \
	--package-id whisper-base.en --quantization {fp16,q8-0,q4-k}
	```

	The `.oasr` container is GGUF-backed; packs use zero-copy mmap weight binding and graph
	buffer reuse to keep peak memory low.

	## ⚖️ License

	These packs inherit the upstream model's license: Apache-2.0
	([source](https://huggingface.co/openai/whisper-base.en/blob/main/README.md)). OpenASR packaging retains the upstream copyright and
	NOTICE; the only modifications are format conversion and quantization.

	## 🙏 Acknowledgements

	This pack is a redistribution of Whisper Base.en, released by OpenAI
	([openai/whisper-base.en](https://huggingface.co/openai/whisper-base.en)).
	All credit for the original model, training recipe, and weights belongs to OpenAI. The
	upstream Hugging Face model card declares Apache-2.0 licensing; OpenASR only converts the
	weights into `.oasr` packages and adds quantized builds for local runtime use.

	## 🔗 Links

	- 🦀 OpenASR — <https://github.com/QuintinShaw/openasr>
	- 🌐 Website — <https://openasr.org>
	- 🤗 Upstream model — [openai/whisper-base.en](https://huggingface.co/openai/whisper-base.en)