adept/fuyu-8b
Image-to-Text
โข
9B
โข
Updated
โข
38.2k
โข
1.02k
Create a talking head video from an image and video
Generate and convert voice using text and audio inputs
Generate realistic audio from text