seniruk commited on
Commit
5aafc5a
·
verified ·
1 Parent(s): 384f5be

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -5
README.md CHANGED
@@ -15,6 +15,8 @@ metrics:
15
  model-index:
16
  - name: Sinscribe
17
  results: []
 
 
18
  ---
19
 
20
  # Hi, I’m Seniru Epasinghe 👋
@@ -43,11 +45,7 @@ It achieves the following results on the evaluation set:
43
  Can be used for Sinhala speech to text conversions. Make sure to input noise low audio to the model, to get the best outcome.
44
 
45
  ## Training and evaluation data
46
-
47
- Trained on custom dataset made by preprocessing, cleaning and combining below datasets -> Final Model ready dataset with 161296 rows
48
- - [Multi speaket TTS dataset - Sinhala](https://www.kaggle.com/datasets/keshan/multi-speaket-tts-dataset-sinhala)
49
- - [Large Sinhala ASR training dataset](https://www.kaggle.com/datasets/keshan/large-sinhala-asr-training-dataset)
50
- - [sinhala-tts-dataset](https://github.com/pnfo/sinhala-tts-dataset)
51
 
52
  Trained on above final dataset with 2 epochs on a device with below spec for 41:00:59 hours
53
  - 16GB RAM
 
15
  model-index:
16
  - name: Sinscribe
17
  results: []
18
+ datasets:
19
+ - seniruk/sinscribe-sinhala-stt
20
  ---
21
 
22
  # Hi, I’m Seniru Epasinghe 👋
 
45
  Can be used for Sinhala speech to text conversions. Make sure to input noise low audio to the model, to get the best outcome.
46
 
47
  ## Training and evaluation data
48
+ Trained on the custom dataset - [seniruk/sinscribe-sinhala-stt](https://huggingface.co/datasets/seniruk/sinscribe-sinhala-stt)
 
 
 
 
49
 
50
  Trained on above final dataset with 2 epochs on a device with below spec for 41:00:59 hours
51
  - 16GB RAM