Instructions to use maya-research/Veena with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use maya-research/Veena with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="maya-research/Veena")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("maya-research/Veena") model = AutoModelForCausalLM.from_pretrained("maya-research/Veena") - Notebooks
- Google Colab
- Kaggle
Female speaker "agastya" generating male audio note.
First of all thank you for this great model!
I noticed that for a shorter sentence, the speaker named "agastya" produced a male-sounding voice. Here is an example of the audio output:
English example which produced male audio
text_english = "Produce intelligence to generate human voices"
audio = generate_speech(text_english, speaker="agastya")
sf.write("output_english_agastya.wav", audio, 24000)
Yes , there are some other issues with this model , it hallucinates while using some commas in the text , dont think this can be used in production
Early checkpointed model. Punctuations, numbers, names are what we are focusing on next. Also agastya is a male. Check it out in MayaResearch.ai/playground
Agastya is sometimes speaking in Male and sometimes in female voice . How is it possible technically , just curious to understand the problem