UnidentifiedPerson
/

VoiceModels

Model card Files Files and versions

UnidentifiedPerson commited on Jul 26, 2024

Commit

fe3b929

·

verified ·

1 Parent(s): f537183

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -22,7 +22,10 @@ All of the datasets used to train these models are:
     2. These datasets are edited to contain the best high-quality audio of the speaker's voice with no background noise, music, silence, or any artifacts.
     3.The sample rate of all of these datasets are 44100 hz with the training using the 48k hz.
 Training:

     2. These datasets are edited to contain the best high-quality audio of the speaker's voice with no background noise, music, silence, or any artifacts.
     3.The sample rate of all of these datasets are 44100 hz with the training using the 48k hz.
+    4. For the dataset recording and extraction process sometimes it may not be 100% perfect due to background noise or music interfering and in some cases
+       I may not even reach to the 20-25 minute mark since there may be very little or no data available, as such I also reduce the number of epochs to 200 to
+       prevent overtraining and achieve the highest quality with minimal dataset length.
 Training: