Self Forcing Wan 2.1
π₯
322
Real-time video generation
Real-time video generation
image2mesh
Generate audio from text prompts
Generate speech from text
Generate 3D models from text descriptions
Audio-Driven Multi-Person Conversational Video Generation
A Unified Framework for Image Customization
Try out Mistral's latest OCR with pdfs and images
Chat with AI assistant powered by Qwen3 model
Demo for Nanonets-OCR
Chat with MedGemma 4B, a medical variant of Gemma 3
Generate medically-informed responses using prompts
Transcribe audio to text with timestamps
Conversational speech generation
Transcribe English audio to text
Clarity AI Upscaler Reproduction
Upscale low-resolution images to high resolution
Generate custom songs from lyrics and prompts
OmniGen2: Unified Image Understanding and Generation.