Update README.md
Browse files
README.md
CHANGED
|
@@ -24,7 +24,7 @@ This model is a fine-tuned version of [Whisper Larg v3](https://github.com/opena
|
|
| 24 |
|
| 25 |
- **Base Model**: Whisper Large V3
|
| 26 |
- **Fine-tuned for**: Levantine Arabic (Israeli Dialect)
|
| 27 |
-
- **WER on test set**:
|
| 28 |
|
| 29 |
## Training Data
|
| 30 |
|
|
@@ -37,7 +37,7 @@ The dataset used for training and fine-tuning this model consists of approximate
|
|
| 37 |
- **Annotation**: Human-transcribed and annotated for high accuracy.
|
| 38 |
|
| 39 |
## How to Use
|
| 40 |
-
The
|
| 41 |
The model is compatible with 16kHz audio input. Ensure your files are at the same sample rate for optimal results. You can load the model as follows:
|
| 42 |
|
| 43 |
Will save a .vtt file with transcriptions and timestamps in audio_dir:
|
|
|
|
| 24 |
|
| 25 |
- **Base Model**: Whisper Large V3
|
| 26 |
- **Fine-tuned for**: Levantine Arabic (Israeli Dialect)
|
| 27 |
+
- **WER on test set**: 33%
|
| 28 |
|
| 29 |
## Training Data
|
| 30 |
|
|
|
|
| 37 |
- **Annotation**: Human-transcribed and annotated for high accuracy.
|
| 38 |
|
| 39 |
## How to Use
|
| 40 |
+
The fine-tuned model was converted using the [faster-whisper](https://github.com/SYSTRAN/faster-whisper) package, enabling inference up to 4× faster than OpenAI's Whisper.
|
| 41 |
The model is compatible with 16kHz audio input. Ensure your files are at the same sample rate for optimal results. You can load the model as follows:
|
| 42 |
|
| 43 |
Will save a .vtt file with transcriptions and timestamps in audio_dir:
|