Voice conversion framework based on VITS
Generate depth map from any photo
Generate a 3D mesh model from an image