pocketmonkey
/

speecht5_tts_urdu

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

pocketmonkey commited on Jul 23, 2024

Commit

fb97976

·

verified ·

1 Parent(s): 6512e36

Update README.md

Files changed (1) hide show

README.md +38 -1

README.md CHANGED Viewed

@@ -23,7 +23,44 @@ It achieves the following results on the evaluation set:
 trianed using roman urdu, using a transliteration function normal urdu was  mapped to roman urdu.
-## Intended uses & limitations
 More information needed

 trianed using roman urdu, using a transliteration function normal urdu was  mapped to roman urdu.
+## Use
+from IPython.display import Audio
+load the model:
+model= SpeechT5ForTextToSpeech.from_pretrained("pocketmonkey/speecht5_tts_urdu")
+get a speaker embedding:
+example = dataset["test"][304]
+speaker_embeddings = torch.tensor(example["speaker_embeddings"]).unsqueeze(0)
+speaker_embeddings.shape
+def urdu_to_roman_urdu(text):
+    urdu_to_roman_dict = {
+        'ا': 'a', 'ب': 'b', 'پ': 'p', 'ت': 't', 'ٹ': 't', 'ث': 's', 'ج': 'j', 'چ': 'ch',
+        'ح': 'h', 'خ': 'kh', 'د': 'd', 'ڈ': 'd', 'ذ': 'z', 'ر': 'r', 'ڑ': 'r', 'ز': 'z',
+        'ژ': 'zh', 'س': 's', 'ش': 'sh', 'ص': 's', 'ض': 'z', 'ط': 't', 'ظ': 'z', 'ع': 'a',
+        'غ': 'gh', 'ف': 'f', 'ق': 'q', 'ک': 'k', 'گ': 'g', 'ل': 'l', 'م': 'm', 'ن': 'n',
+        'ں': 'n', 'و': 'w', 'ہ': 'h', 'ء': 'a', 'ی': 'y', 'ے': 'e', 'آ': 'a', 'ؤ': 'o',
+        'ئ': 'y', 'ٔ': '', ' ': ' ', '۔': '.', '،': ',', '؛': ';', '؟': '?','ھ': 'h'
+    }
+    roman_text = ''.join(urdu_to_roman_dict.get(char, char) for char in text)
+    return roman_text
+text = "زندگی میں کامیابی"
+text=urdu_to_roman_urdu(text)
+inputs = processor(text=text, return_tensors="pt")
+with torch.no_grad():
+    speech = vocoder(spectrogram)
+Audio(speech.numpy(), rate=16000)
 More information needed