HebArabNlpProject
/

WhisperLevantine

Automatic Speech Recognition

Model card Files Files and versions

carmi commited on May 20, 2025

Commit

dd33b1c

·

verified ·

1 Parent(s): f5b7538

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ This model is a fine-tuned version of [Whisper Larg v3](https://github.com/opena
 - **Base Model**: Whisper Large V3
 - **Fine-tuned for**: Levantine Arabic (Israeli Dialect)
-- **WER on test set**: 35%
 ## Training Data
@@ -37,7 +37,7 @@ The dataset used for training and fine-tuning this model consists of approximate
 - **Annotation**: Human-transcribed and annotated for high accuracy.
 ## How to Use
-The finetuned model was converted using [faster-whisper](https://github.com/SYSTRAN/faster-whisper) package to run up to 4 times faster than openai/whisper.
 The model is compatible with 16kHz audio input. Ensure your files are at the same sample rate for optimal results. You can load the model as follows:
 Will save a .vtt file with transcriptions and timestamps in audio_dir:

 - **Base Model**: Whisper Large V3
 - **Fine-tuned for**: Levantine Arabic (Israeli Dialect)
+- **WER on test set**: 33%
 ## Training Data
 - **Annotation**: Human-transcribed and annotated for high accuracy.
 ## How to Use
+The fine-tuned model was converted using the [faster-whisper](https://github.com/SYSTRAN/faster-whisper) package, enabling inference up to 4× faster than OpenAI's Whisper.
 The model is compatible with 16kHz audio input. Ensure your files are at the same sample rate for optimal results. You can load the model as follows:
 Will save a .vtt file with transcriptions and timestamps in audio_dir: