Scalable and Versatile 3D Generation from images
Generate speech in a cloned voice from reference audio