Observations on the audio output

by mobeidat - opened Feb 19

Feb 19

Thanks for the excellent work on this. I tested and it works great, have the following observations:\

I tested with a few examples that I have other models struggle with but it read them correctly some thing like 'الإعدادات', 'لهذا', etc...
It does not speak digits correctly. I tried with Arabic numerals and Hindi numerals. Arabic numerals are spoken a bit better than Hindi but both are not understandable.
It speaks too fast. I reduced cfg_weight to 0.1 but still faster than normal speech

Network for Advancing Modern ArabicNLP & AI org Mar 15

thanks for the feedback, we work on second version of tther model where we are willing to solve these issues..

Omartificial-Intelligence-Space changed discussion status to closed Mar 15

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment