ctranslate2-4you's picture
Update README.md
e1349cb verified
---
language: en
license: apache-2.0
library_name: ctranslate2
pipeline_tag: automatic-speech-recognition
tags:
- whisper
- ctranslate2
- speech-recognition
- transcription
- float32
base_model: openai/whisper-large-v3
---
# πŸ—£οΈ Whisper Large-v3 β€” CTranslate2 (`float32`)
This is [OpenAI's whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) converted to [CTranslate2](https://github.com/OpenNMT/CTranslate2) format with `float32` precision.
> [!TIP]
> Also available in other precisions:
> [`float16`](https://huggingface.co/ctranslate2-4you/whisper-large-v3-ct2-float16) Β· [`bfloat16`](https://huggingface.co/ctranslate2-4you/whisper-large-v3-ct2-bfloat16)
---
## πŸ“‹ Details
| | |
|---|---|
| **Base model** | [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) |
| **Format** | CTranslate2 |
| **Precision** | `float32` |
| **Language** | Multilingual |
| **Task** | Automatic Speech Recognition |
---
## ⚑ Quick Start
Install the inference library:
```bash
pip install whisper-s2t-reborn
```
Transcribe an audio file:
```python
import whisper_s2t
model = whisper_s2t.load_model(
model_identifier="large-v3",
compute_type="float32",
device="cuda",
)
result = model.transcribe_with_vad(
["audio.wav"],
lang_codes=["en"],
tasks=["transcribe"],
initial_prompts=[None],
batch_size=1, # increase this to significantly improve throughput
)
for segment in result[0]:
print(segment["text"])
```
> [!NOTE]
> Models are **auto-downloaded** from this repo the first time you run inference. No manual download required.
*See the [whisper-s2t-reborn](https://github.com/BBC-Esq/WhisperS2T-reborn) repository for the full list of available parameters.*
---
## πŸ“¦ All Available CTranslate2 Whisper Models
Every model below is hosted at [huggingface.co/ctranslate2-4you](https://huggingface.co/ctranslate2-4you) and works with [whisper-s2t-reborn](https://github.com/BBC-Esq/WhisperS2T-reborn).
### 🌍 Standard Whisper (Multilingual)
| Model | `float32` | `float16` | `bfloat16` |
|---|:---:|:---:|:---:|
| **tiny** | [Link](https://huggingface.co/ctranslate2-4you/whisper-tiny-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/whisper-tiny-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/whisper-tiny-ct2-bfloat16) |
| **base** | [Link](https://huggingface.co/ctranslate2-4you/whisper-base-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/whisper-base-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/whisper-base-ct2-bfloat16) |
| **small** | [Link](https://huggingface.co/ctranslate2-4you/whisper-small-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/whisper-small-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/whisper-small-ct2-bfloat16) |
| **medium** | [Link](https://huggingface.co/ctranslate2-4you/whisper-medium-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/whisper-medium-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/whisper-medium-ct2-bfloat16) |
| **large-v3** | [Link](https://huggingface.co/ctranslate2-4you/whisper-large-v3-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/whisper-large-v3-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/whisper-large-v3-ct2-bfloat16) |
### πŸ‡ΊπŸ‡Έ Whisper English-Only
| Model | `float32` | `float16` | `bfloat16` |
|---|:---:|:---:|:---:|
| **tiny.en** | [Link](https://huggingface.co/ctranslate2-4you/whisper-tiny.en-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/whisper-tiny.en-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/whisper-tiny.en-ct2-bfloat16) |
| **base.en** | [Link](https://huggingface.co/ctranslate2-4you/whisper-base.en-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/whisper-base.en-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/whisper-base.en-ct2-bfloat16) |
| **small.en** | [Link](https://huggingface.co/ctranslate2-4you/whisper-small.en-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/whisper-small.en-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/whisper-small.en-ct2-bfloat16) |
| **medium.en** | [Link](https://huggingface.co/ctranslate2-4you/whisper-medium.en-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/whisper-medium.en-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/whisper-medium.en-ct2-bfloat16) |
### ⚑ Distilled Whisper
| Model | `float32` | `float16` | `bfloat16` |
|---|:---:|:---:|:---:|
| **distil-small.en** | [Link](https://huggingface.co/ctranslate2-4you/distil-whisper-small.en-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/distil-whisper-small.en-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/distil-whisper-small.en-ct2-bfloat16) |
| **distil-medium.en** | [Link](https://huggingface.co/ctranslate2-4you/distil-whisper-medium.en-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/distil-whisper-medium.en-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/distil-whisper-medium.en-ct2-bfloat16) |
| **distil-large-v3** | [Link](https://huggingface.co/ctranslate2-4you/distil-whisper-large-v3-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/distil-whisper-large-v3-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/distil-whisper-large-v3-ct2-bfloat16) |
### πŸš€ Whisper Large-v3 Turbo
| Model | `float32` | `float16` | `bfloat16` |
|---|:---:|:---:|:---:|
| **large-v3-turbo** | [Link](https://huggingface.co/ctranslate2-4you/whisper-large-v3-turbo-ct2-float32) | [Link](https://huggingface.co/ctranslate2-4you/whisper-large-v3-turbo-ct2-float16) | [Link](https://huggingface.co/ctranslate2-4you/whisper-large-v3-turbo-ct2-bfloat16) |
---
## πŸ”— Links
- πŸ“¦ **Inference library** β€” [whisper-s2t-reborn](https://github.com/BBC-Esq/WhisperS2T-reborn)
- πŸ—οΈ **CTranslate2** β€” [github.com/OpenNMT/CTranslate2](https://github.com/OpenNMT/CTranslate2)
- 🧠 **Original model** β€” [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3)