metadata
pipeline_tag: any-to-any
library_name: transformers
tags:
- text-to-image
- image-editing
- image-understanding
- vision-language
- multimodal
- unified-model
license: mit
UniPic3-Consistency-Model
Introduction
UniPic3-Consistency-Model is a few-step image editing and multi-image composition model based on Consistency Flow Matching (CM).
The model learns a trajectory-consistent mapping from noisy latent states to clean images, enabling stable generation with strong structural consistency.
It is distilled from UniPic-3 to support fast inference (≤8 steps) while preserving composition correctness.
The model is especially suitable for scenarios requiring geometric alignment and semantic coherence, such as multi-image composition and human–object interaction (HOI).