Self Forcing Wan 2.1
π₯
326
Real-time video generation
Real-time video generation
coreOCR / Camel-Doc-OCR / docscopeOCR / MonkeyOCR
ultra-fast video model, LTX 0.9.8 13B distilled
Create multiple diagram types instantly from JSON!
Convert document images to structured text and data
Conversational speech generation
Generate realistic person images with new clothes or poses
Generate synchronized audio for videos from text prompts
Generate images from text prompts with customizable settings
Diffusion-based multi-modal virtual try-on pipeline demo
Edit images with scribbleβbased color and edge control