Update README.md
Browse files
README.md
CHANGED
|
@@ -15,6 +15,8 @@ metrics:
|
|
| 15 |
model-index:
|
| 16 |
- name: Sinscribe
|
| 17 |
results: []
|
|
|
|
|
|
|
| 18 |
---
|
| 19 |
|
| 20 |
# Hi, I’m Seniru Epasinghe 👋
|
|
@@ -43,11 +45,7 @@ It achieves the following results on the evaluation set:
|
|
| 43 |
Can be used for Sinhala speech to text conversions. Make sure to input noise low audio to the model, to get the best outcome.
|
| 44 |
|
| 45 |
## Training and evaluation data
|
| 46 |
-
|
| 47 |
-
Trained on custom dataset made by preprocessing, cleaning and combining below datasets -> Final Model ready dataset with 161296 rows
|
| 48 |
-
- [Multi speaket TTS dataset - Sinhala](https://www.kaggle.com/datasets/keshan/multi-speaket-tts-dataset-sinhala)
|
| 49 |
-
- [Large Sinhala ASR training dataset](https://www.kaggle.com/datasets/keshan/large-sinhala-asr-training-dataset)
|
| 50 |
-
- [sinhala-tts-dataset](https://github.com/pnfo/sinhala-tts-dataset)
|
| 51 |
|
| 52 |
Trained on above final dataset with 2 epochs on a device with below spec for 41:00:59 hours
|
| 53 |
- 16GB RAM
|
|
|
|
| 15 |
model-index:
|
| 16 |
- name: Sinscribe
|
| 17 |
results: []
|
| 18 |
+
datasets:
|
| 19 |
+
- seniruk/sinscribe-sinhala-stt
|
| 20 |
---
|
| 21 |
|
| 22 |
# Hi, I’m Seniru Epasinghe 👋
|
|
|
|
| 45 |
Can be used for Sinhala speech to text conversions. Make sure to input noise low audio to the model, to get the best outcome.
|
| 46 |
|
| 47 |
## Training and evaluation data
|
| 48 |
+
Trained on the custom dataset - [seniruk/sinscribe-sinhala-stt](https://huggingface.co/datasets/seniruk/sinscribe-sinhala-stt)
|
|
|
|
|
|
|
|
|
|
|
|
|
| 49 |
|
| 50 |
Trained on above final dataset with 2 epochs on a device with below spec for 41:00:59 hours
|
| 51 |
- 16GB RAM
|