4M Demo
⚡
203
4M: Massively Multimodal Masked Modeling
4M: Massively Multimodal Masked Modeling
Mix two images with a slider
Generate custom images with LoRA‑enhanced Stable Diffusion
Generate descriptive captions for any image
Generate passport‑ready ID photos from a portrait
Train Free Personaliz° Diff w/ Stochastic Optimal Control
Segment objects in images or videos using text prompts
Generate images from Japanese text prompts