Generate and edit images using text instructions
Generate text from uploaded or recorded audio
Conversational speech generation
Generate text and segment images using PaliGemma 2
Co-Speech Gesture Video Generation (ICLR 2025 Oral)
Discuss and provide feedback on Hugging Face Hub features
Text-to-Video