voice cloning capability
#4
by
odg123
- opened
does it support zero shot voice cloning? could you please share any reference?
what does mean by Cross-lingual Voice Cloning (Multilingual Voice Transfer) support; how that can be utilised well?
Orpheus is under-trained for zero shot voice cloning. It doesn't work too well. We would just fine-tuning on 1-5 hrs of data.
For multi-lingual output please use the following prompt. Language prefix should be users original language and transcript can be anything.
{
"eval_text_user": f"<custom_token_3><|begin_of_text|>bengali125: मुझे तो लगा वो आएगा, ஆனா அவன் வந்து full drama பண்ணிட்டான், আর শেষে আবার আমাকে দোষ দিচ্ছে <|eot_id|><custom_token_4><custom_token_5><custom_token_1>"
}
SaudxInu
changed discussion status to
closed