tencent/HunyuanImage-3.0
Text-to-Image β’ 83B β’ Updated β’ 995k β’ β’ 1.09k
Generate high-quality speech from text with optional voice cloning
OmniParser, turn your LLM into GUI agent
Generate depth video from input video
Audio Conditioned LipSync with Latent Diffusion Models
Generate realistic person images with new clothes or poses
Generate synchronized audio for videos from text prompts