Modify images using prompts and diffusion inversion
Generate a 3D mesh from a single image
High-fidelity Text-To-Speech