Collections
Discover the best community collections!
Collections trending this week
-
Gemini: A Family of Highly Capable Multimodal Models
Paper β’ 2312.11805 β’ Published β’ 49 -
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Paper β’ 2312.14233 β’ Published β’ 16 -
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities
Paper β’ 2405.18669 β’ Published β’ 12 -
Ming-Omni: A Unified Multimodal Model for Perception and Generation
Paper β’ 2506.09344 β’ Published β’ 31
-
ControlNet V1.1
π1.18kGenerate images guided by edges, poses, sketches, and more
-
Stable Diffusion Web UI
π§9Generate images from text prompts
-
InstantID
π»3.58kGenerate a custom image that keeps your face identity
-
WeShopAI Virtual Try On
π504WeShopAI Virtual Try On. Switch outfits with ease virtually.
-
Gemini: A Family of Highly Capable Multimodal Models
Paper β’ 2312.11805 β’ Published β’ 49 -
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Paper β’ 2312.14233 β’ Published β’ 16 -
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities
Paper β’ 2405.18669 β’ Published β’ 12 -
Ming-Omni: A Unified Multimodal Model for Perception and Generation
Paper β’ 2506.09344 β’ Published β’ 31
-
ControlNet V1.1
π1.18kGenerate images guided by edges, poses, sketches, and more
-
Stable Diffusion Web UI
π§9Generate images from text prompts
-
InstantID
π»3.58kGenerate a custom image that keeps your face identity
-
WeShopAI Virtual Try On
π504WeShopAI Virtual Try On. Switch outfits with ease virtually.