File size: 6,473 Bytes
0b82cfb d92ff94 0b82cfb d92ff94 0b82cfb a81898e 0b82cfb 2c57aa9 0b82cfb d92ff94 0b82cfb d92ff94 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 |
---
license: apache-2.0
base_model:
- Yuanshi/OminiControl
---
<div align="center">
<a href="https://fotographer.ai/">
<picture>
<source media="(max-width: 424px" srcset="https://storage.googleapis.com/fotographer-cdn/app-static-assets/zen_ctrl/banner_sm.avif" type="image/avif">
<source media="(min-width: 425px" srcset="https://storage.googleapis.com/fotographer-cdn/app-static-assets/zen_ctrl/banner.avif" type="image/avif">
<source media="(max-width: 424px" srcset="https://storage.googleapis.com/fotographer-cdn/app-static-assets/zen_ctrl/banner_sm.webp" type="image/webp">
<source media="(min-width: 425px" srcset="https://storage.googleapis.com/fotographer-cdn/app-static-assets/zen_ctrl/banner.webp" type="image/webp">
<img alt="ZenCtrl Banner" src="https://storage.googleapis.com/fotographer-cdn/app-static-assets/zen_ctrl/banner.png" />
</picture>
</a>
<h1>ZenCtrl</h1>
</div>
**An all-in-one, control framework for unified visual content creation using GenAI.**
Generate multi-view, diverse-scene, and task-specific high-resolution images from a single subject imageโwithout fine-tuning.
<div align="center" style="line-height: 1;">
<a href="https://github.com/FotographerAI/ZenCtrl/tree/main" target="_blank" style="margin: 2px;" name="github_repo_link"><img src="https://img.shields.io/badge/GitHub-Repo-181717.svg" alt="GitHub Repo" style="display: inline-block; vertical-align: middle;"></a>
<!-- <a href="https://huggingface.co/spaces/YOUR_ORG/ZenCtrl" target="_blank" name="huggingface_space_link"><img src="https://img.shields.io/badge/๐ค_HuggingFace-Space-ffbd45.svg" alt="HuggingFace Space" style="display: inline-block; vertical-align: middle;"></a> -->
<a href="https://discord.com/invite/b9RuYQ3F8k" target="_blank" style="margin: 2px;" name="discord_link"><img src="https://img.shields.io/badge/Discord-Join-7289da.svg?logo=discord" alt="Discord" style="display: inline-block; vertical-align: middle;"></a>
<a href="https://fotographer.ai/" target="_blank" style="margin: 2px;" name="lp_link"><img src="https://img.shields.io/badge/Website-Landing_Page-blue" alt="LP" style="display: inline-block; vertical-align: middle;"></a>
<a href="https://x.com/FotographerAI" target="_blank" style="margin: 2px;" name="twitter_link"><img src="https://img.shields.io/twitter/follow/FotographerAI?style=social" alt="X" style="display: inline-block; vertical-align: middle;"></a>
</div>
---
## ๐ง Overview
**ZenCtrl** is a comprehensive toolkit built to tackle core challenges in image generation:
- No fine-tuning needed โ works from **a single subject image**
- Maintains **control over shape, pose, camera angle, context**
- Supports **high-resolution**, multi-scene generation
- Modular toolkit for preprocessing, control, editing, and post-processing tasks
ZenCtrl is based on OminiControl but enhanced with more fine-grained control, consistent subject preservation, and more improved and ready-to-use models. Our goal is to build an **agentic visual generation system** that can orchestrate image/video creation from **LLM-driven recipes.**
---
## ๐ฆ Github code
https://github.com/FotographerAI/ZenCtrl/tree/main
---
## ๐ Toolkit Components (coming soon)
### ๐งน Preprocessing
- Background removal
- Matting
- Reshaping
- Segmentation
### ๐ฎ Control Models
- Shape (Canny, HED, Scribble, Depth)
- Pose (OpenPose, DensePose)
- Mask control
- Camera/View control
### ๐จ Post-processing
- Deblurring
- Color fixing
- Natural blending
### โ๏ธ Editing Models
- Inpainting (removal, masked editing, replacement)
- Outpainting
- Transformation / Motion
- Relighting
---
## ๐ฏ Supported Tasks
- Background generation
- Controlled background generation
- Subject-consistent context-aware generation
- Object and subject placement (coming soon)
- In-context image/video generation (coming soon)
- Multi-object/subject merging & blending (coming soon)
- Video generation (coming soon)
---
## ๐ฆ Target Use Cases
- Product photography
- Fashion & accessory try-on
- Virtual try-on (shoes, hats, glasses, etc.)
- People & portrait control
- Illustration, animation, and ad creatives
All of these tasks can be **mixed and layered** โ ZenCtrl is designed to support real-world visual workflows with **agentic task composition**.
---
## ๐ข News
- **2025-03-26**: ๐ง First release โ model weights available on Hugging Face!
- **Coming Soon**: Source code release, Quick Start guide, Example notebooks
- **Next**: Controlled fine-grain version on our platform and API (Pro version)
- **Future**: Video generation toolkit release
## ๐ง Limitations
1. Models currently perform best with **objects**, and to some extent **humans**.
2. Resolution support is currently capped at **1024x1024** (higher quality coming soon).
3. Performance with **illustrations** is currently limited.
4. The models were **not trained on large-scale or highly diverse datasets** yet โ we plan to improve quality and variation by training on larger and more diverse datasets, especially for **illustration and stylized content**.
5. Video support and the full **agentic task pipeline** are still under development.
---
## ๐ To-do
- [x] Release early pretrained model weights for defined tasks
- [ ] Release additional task-specific models and modes
- [ ] Release open source code (coming soon)
- [ ] Release Quick Start guide and example notebooks
- [ ] Launch API access via our app and Baseten for easier deployment
- [ ] Release high-resolution models (1500ร1500+)
- [ ] Enable full toolkit integration with agent API
- [ ] Add video generation module
---
## ๐ค Join the Community
- ๐ฌ [Discord](https://discord.com/invite/b9RuYQ3F8k) โ share ideas and feedback
- ๐ [Landing Page](https://fotographer.ai)
- ๐งช [Try it now on Hugging Face Space (release on 2025/03/28 PST)](https://huggingface.co/fotographerai/zenctrl_tools/tree/main/weights)
<!-- - ๐ง [Blog]() -->
---
## ๐ค Community Collaboration
We hope to collaborate closely with the open-source community to make **ZenCtrl** a powerful and extensible toolkit for visual content creation.
Once the source code is released, we welcome contributions in training, expanding supported use cases, and developing new task-specific modules.
Our vision is to make ZenCtrl the **standard framework** for agentic, high-quality image and video generation โ built together, for everyone.
|