OrlandoHugBot's picture
Create README.md
e8a6161 verified
|
raw
history blame
1.07 kB
metadata
pipeline_tag: any-to-any
library_name: transformers
tags:
  - text-to-image
  - image-editing
  - image-understanding
  - vision-language
  - multimodal
  - unified-model
license: mit

UniPic3-Consistency-Model

GitHub · Stars · Forks


Introduction

UniPic3-Consistency-Model is a few-step image editing and multi-image composition model based on Consistency Flow Matching (CM).

The model learns a trajectory-consistent mapping from noisy latent states to clean images, enabling stable generation with strong structural consistency.
It is distilled from UniPic-3 to support fast inference (≤8 steps) while preserving composition correctness.

The model is especially suitable for scenarios requiring geometric alignment and semantic coherence, such as multi-image composition and human–object interaction (HOI).