Hebrew
dual_ar
Hebrew-Fish / README.md
sleeping4cat's picture
Update README.md
5b1cc79 verified
metadata
license: mit
datasets:
  - upai-inc/saspeech
  - sleeping-ai/Fish-Hebrew
language:
  - he
base_model:
  - fishaudio/fish-speech-1.5

Hebrew is fundamentally a hard language to work in the field of Natural language processing and it is also one of the underrepresented language in the field of Speech-Speech and Text-to-Speech models. Mainly boils down to limited availability of data. To explore Speech-Speech (Voice Cloning), I used Dataset to fine-tune Fish-speech 1.5 on roughly 2.5 hours of Hebrew audio on their Gold-standard subset.

I have also fixed a few bugs on Fish's fine-tuning code and created a pull-request