Multimodal OCR
π
407
Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
Convert and separate audio using models and TTS
Kolors Portrait to keep face identity developed with Flux
Generate images with SD3.5
Generate stunning high quality illusion artwork
Import a portrait, click to move the head!
(Tongyi Lab) ACE: All-round Creator and Editor
Framer: Interactive Frame Interpolation
Apply the motion of a video on a portrait
Generate virtual tryβon image of a person wearing a garment
Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
Generate speech in a cloned voice