Update README.md
Browse files
README.md
CHANGED
|
@@ -26,7 +26,7 @@ This model is a fine-tuned version of [Whisper Larg v3](https://github.com/opena
|
|
| 26 |
|
| 27 |
## Training Data
|
| 28 |
|
| 29 |
-
The dataset used for training and fine-tuning this model consists of approximately
|
| 30 |
|
| 31 |
1. **Self-maintained Collection**: 1,200 hours of audio data curated by the team, covering a wide range of Israeli Levantine Arabic speech.
|
| 32 |
|
|
|
|
| 26 |
|
| 27 |
## Training Data
|
| 28 |
|
| 29 |
+
The dataset used for training and fine-tuning this model consists of approximately 1,200 hours of transcribed audio, primarily featuring Israeli Levantine Arabic, along with some general Levantine Arabic content. The data sources include:
|
| 30 |
|
| 31 |
1. **Self-maintained Collection**: 1,200 hours of audio data curated by the team, covering a wide range of Israeli Levantine Arabic speech.
|
| 32 |
|