The model's architecture is very resource-intensive, but it still has a strong accent when cloning a voice and translating it into another language.
· Sign up or log in to comment