HebArabNlpProject
/

WhisperLevantine

Automatic Speech Recognition

Model card Files Files and versions

carmi commited on May 20, 2025

Commit

3395f36

·

verified ·

1 Parent(s): 49f9c21

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -26,7 +26,7 @@ This model is a fine-tuned version of [Whisper Larg v3](https://github.com/opena
 ## Training Data
-The dataset used for training and fine-tuning this model consists of approximately 2,200 hours of transcribed audio, primarily featuring Israeli Levantine Arabic, along with some general Levantine Arabic content. The data sources include:
 1. **Self-maintained Collection**: 1,200 hours of audio data curated by the team, covering a wide range of Israeli Levantine Arabic speech.

 ## Training Data
+The dataset used for training and fine-tuning this model consists of approximately 1,200 hours of transcribed audio, primarily featuring Israeli Levantine Arabic, along with some general Levantine Arabic content. The data sources include:
 1. **Self-maintained Collection**: 1,200 hours of audio data curated by the team, covering a wide range of Israeli Levantine Arabic speech.