Show-o-RecA / README.md

sanaka87

Update README.md

afab05c verified 5 months ago

preview code

raw

history blame contribute delete

2.42 kB

metadata

base_model:
  - showlab/show-o-w-clip-vit
datasets:
  - brivangl/midjourney-v6-llava
language:
  - en
  - zh
license: apache-2.0
pipeline_tag: text-to-image
library_name: diffusers

Show-o-RecA

A self-supervised training framework that aligns understanding and generation in modest compute, with huge zero-shot gain on generation and editing capability.

This repository hosts the model weights for Show-o-RecA. For installation, usage instructions, and further documentation, please visit Show-o's original GitHub repository.

🧠 Method

📊 Benchmarks

Model	GenEval ↑	DPGBench ↑	WISE ↑
Show-o	0.57	70.65	0.33
Show-o-RecA	0.62	75.70	0.34

License

Show-o-RecA is licensed under the Apache 2.0 license.

✍️ Citation

If you find our work inspiring or use our codebase in your research, please consider giving a star ⭐ and a citation~

@article{xie2025reconstruction,
  title={Reconstruction Alignment Improves Unified Multimodal Models},
  author={Xie, Ji and Darrell, Trevor and Zettlemoyer, Luke and Wang, XuDong},
  journal={arXiv preprint arXiv:2509.07295},
  year={2025}
}