SeeSee21 commited on
Commit
09b31a0
Β·
verified Β·
1 Parent(s): b13f6bc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +190 -152
README.md CHANGED
@@ -1,212 +1,250 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
- pipeline_tag: text-to-image
3
- tags:
4
- - text-to-image
5
- - image-generation
6
- - diffusion-transformer
7
- - dit
8
- - comfyui
9
- - checkpoint
10
- - safetensors
11
- - aio
12
- - all-in-one
13
- - ernie-image
14
- - ernie-image-turbo
15
- - bf16
16
- - fp8
17
- license: apache-2.0
18
- widget:
19
- - text: "A premium sci-fi poster with readable text, cinematic lighting, and a futuristic city skyline at night."
20
- - text: "A clean product advertisement for a coffee brand with readable package text, elegant layout, and warm morning light."
21
  ---
22
 
23
- # ERNIE AIO for ComfyUI
24
- **All-in-One Checkpoints β€’ Base + Turbo β€’ BF16 & FP8 β€’ Integrated Text Encoder + VAE**
25
 
26
- This repository provides **AIO (All-in-One)** checkpoint versions of **ERNIE-Image** and **ERNIE-Image-Turbo** for **ComfyUI**.
 
 
 
27
 
28
- A ready-to-use **ComfyUI workflow** is included in the **`Workflow/`** folder, and the optional **Prompt Enhancer** files are included in the **`prompt-enhancer/`** folder.
29
 
30
- These AIO releases are designed for users who prefer a classic **single-checkpoint workflow**.
31
- Each checkpoint already contains the **image model**, **text encoder**, and **VAE**, so no separate text encoder or VAE loading is required.
32
 
33
- ### ERNIE-AIO-Base
34
- ![ERNIE-AIO-Base preview](./images/ernie-aio-base.png)
35
 
36
- ### ERNIE-AIO-Turbo
37
- ![ERNIE-AIO-Turbo preview](./images/ernie-aio-turbo.png)
 
38
 
39
- ## Overview
40
- This repository provides **AIO (All-in-One)** checkpoint versions of **ERNIE-Image** and **ERNIE-Image-Turbo** for **ComfyUI**.
41
 
42
- These AIO releases are designed for users who prefer a classic **single-checkpoint workflow**.
43
- Each checkpoint already contains the **image model**, **text encoder**, and **VAE**, so no separate text encoder or VAE loading is required.
44
 
 
45
 
46
- ## Available Files
47
 
48
- ### Main checkpoints
49
- - `ERNIE-AIO-Base-bf16.safetensors`
50
- - `ERNIE-AIO-Base-fp8.safetensors`
51
- - `ERNIE-AIO-Turbo-bf16.safetensors`
52
- - `ERNIE-AIO-Turbo-fp8.safetensors`
53
 
54
- ### Workflow
55
- - `Workflow/ERNIE-AIO-Upscale.json`
 
56
 
57
- ### Prompt Enhancer
58
- - `prompt-enhancer/` contains the optional ERNIE Prompt Enhancer files used by the workflow
59
 
60
- ## What β€œAIO” means here
61
- The AIO checkpoints are packaged so they can be placed directly in:
 
 
62
 
63
- `ComfyUI/models/checkpoints`
64
 
65
- and loaded with the normal **Checkpoint Loader** node in ComfyUI.
66
 
67
- This means you can use a very small workflow such as:
68
 
69
- 1. **Checkpoint Loader**
70
- 2. **CLIP Text Encode (Prompt)**
71
- 3. **Empty Latent Image**
72
- 4. **KSampler**
73
- 5. **VAE Decode**
74
- 6. **Save Image**
 
75
 
76
- A workflow file may also be included separately for convenience.
 
 
 
77
 
78
- ## Model Background
79
- These AIO releases are based on Baidu’s official **ERNIE-Image** family.
80
 
81
- According to the official model pages:
82
 
83
- - **ERNIE-Image** is an open text-to-image model built on a **single-stream Diffusion Transformer (DiT)** and paired with a lightweight **Prompt Enhancer**
84
- - **ERNIE-Image-Turbo** is the **distilled** variant, designed for strong fidelity in only **8 inference steps**
85
- - The official releases emphasize **instruction following**, **text rendering**, and **structured image generation**, making them especially interesting for posters, comics, infographics, UI-like layouts, and other text-heavy or layout-sensitive images
 
 
 
 
86
 
87
- ## AIO Variants
 
88
 
89
- ### ERNIE-AIO-Base
90
- The Base version is intended for more deliberate, stable, and higher-quality generation.
91
 
92
- **Recommended settings:**
93
- - **Steps:** 50
94
- - **CFG:** 4.0
95
 
96
- Use this version when you want stronger refinement, more careful rendering, and the full non-turbo experience.
 
 
 
97
 
98
- ### ERNIE-AIO-Turbo
99
- The Turbo version is intended for fast generation and quick iteration.
 
100
 
101
- **Recommended settings:**
102
- - **Steps:** 8
103
- - **CFG:** 1.0
104
 
105
- Use this version when you want faster turnaround while still keeping good prompt fidelity and strong visual output.
 
 
106
 
107
- ## Precision / Quantization Notes
108
- The **FP8** versions in this repository were quantized by me.
 
 
 
 
 
109
 
110
- For these FP8 AIO checkpoints:
111
 
112
- - **Image model:** FP8
113
- - **Text encoder:** FP8
114
- - **VAE:** FP32
115
 
116
- This setup was chosen to reduce size and improve practicality while keeping decoding stable.
117
 
118
- ## Prompt Enhancer
 
 
119
 
120
- This repository also includes an optional **Prompt Enhancer** in the `text encoder/` folder.
121
 
122
- It can be used with the included workflow to automatically expand short prompts into richer and more descriptive prompts.
123
- This is optional, but useful if you want more detailed prompt wording or a more guided workflow experience.
124
 
125
- ## Prompting
126
- These models work well with **natural language prompts**.
127
 
128
- They are especially suitable for:
129
- - posters
130
- - title cards
131
- - product ads
132
- - readable text in images
133
- - structured layouts
134
- - comics
135
- - infographics
136
- - UI-like visuals
137
- - realistic and stylized imagery
138
 
139
- For best results, describe:
140
- - subject
141
- - environment
142
- - composition
143
- - mood
144
- - lighting
145
- - camera framing
146
- - text that should appear in the image
147
 
148
- ## Example Prompt Ideas
149
- - A cinematic movie poster with a bold readable title at the top, dramatic lighting, and a futuristic skyline.
150
- - A modern coffee advertisement with readable packaging text and a premium warm studio look.
151
- - A clean infographic with clearly labeled sections and a structured layout.
152
- - A comic-style scene with readable speech bubbles and strong composition.
153
 
154
- ## ComfyUI Usage
155
- Place the checkpoints in:
 
 
 
 
 
 
156
 
157
- `ComfyUI/models/checkpoints`
 
 
 
 
 
158
 
159
- Then load them through **Checkpoint Loader** like a normal checkpoint.
160
 
161
- No separate:
162
- - VAE loader
163
- - text encoder loader
164
- - extra model assembly
165
 
166
- is required for the AIO versions.
167
 
168
- ## Repository Structure
 
 
 
 
 
169
 
170
- - Main AIO checkpoints for **ERNIE-AIO-Base** and **ERNIE-AIO-Turbo**
171
- - `Workflow/` contains the ready-to-use ComfyUI workflow JSON
172
- - `prompt-enhancer/` contains the optional Prompt Enhancer files used by the workflow
173
- - `images/` contains the preview / title images for the Base and Turbo releases
174
 
175
- ## Civitai Releases
176
 
177
- If you prefer the Civitai pages, you can also find the releases here:
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
178
 
179
- - Turbo: https://civitai.red/models/2551548?modelVersionId=2868512
180
- - Base: https://civitai.red/models/2551548?modelVersionId=2867462
181
 
182
- ## Limitations
183
- As with other image generation models:
184
 
185
- - text rendering can still vary depending on prompt complexity
186
- - highly dense typography may require retries
187
- - prompt phrasing can strongly affect layout quality
188
- - FP8 variants may behave slightly differently from BF16 variants depending on hardware and workflow
189
 
190
- ## Intended Use
191
- These releases are intended for:
192
- - local image generation in ComfyUI
193
- - poster and design-style generation
194
- - readable title images
195
- - structured compositions
196
- - users who want a simple all-in-one checkpoint workflow
197
 
198
- ## Credits
199
- Huge thanks to the original **ERNIE-Image** team at **Baidu** for releasing the official models.
200
 
201
- Official sources:
202
- - **ERNIE-Image:** https://huggingface.co/baidu/ERNIE-Image
203
- - **ERNIE-Image-Turbo:** https://huggingface.co/baidu/ERNIE-Image-Turbo
204
- - **GitHub:** https://github.com/baidu/ERNIE-Image
 
205
 
206
- ## License
207
- This repository is based on the official ERNIE-Image releases, whose Hugging Face model pages list the license as **Apache-2.0**.
208
 
209
- ## Final Note
210
- These AIO releases were made to simplify the ERNIE workflow for ComfyUI users.
211
 
212
- Load checkpoint, write prompt, sample, decode, save.
 
1
+ ---
2
+ pipeline_tag: text-to-image
3
+ tags:
4
+ - text-to-image
5
+ - image-generation
6
+ - diffusion-transformer
7
+ - dit
8
+ - comfyui
9
+ - checkpoint
10
+ - safetensors
11
+ - aio
12
+ - all-in-one
13
+ - ernie-image
14
+ - ernie-image-turbo
15
+ - bf16
16
+ - fp8
17
+ license: apache-2.0
18
+ widget:
19
+ - text: >-
20
+ A premium sci-fi poster with readable text, cinematic lighting, and a
21
+ futuristic city skyline at night.
22
+ - text: >-
23
+ A clean product advertisement for a coffee brand with readable package text,
24
+ elegant layout, and warm morning light.
25
+ language:
26
+ - en
27
+ ---
28
+
29
+ # πŸ–ΌοΈ ERNIE AIO for ComfyUI
30
+
31
+ **All-in-One Checkpoints β€’ Base + Turbo β€’ BF16 & FP8 β€’ Text Encoder + VAE Included**
32
+
33
+ [![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
34
+ [![ComfyUI](https://img.shields.io/badge/ComfyUI-Compatible-green.svg)](https://github.com/comfyanonymous/ComfyUI)
35
+
36
  ---
37
+
38
+ - [πŸ“¦ Models](#-models)
39
+ - [πŸ“‹ Workflow](#-workflow)
40
+ - [πŸ“Έ ERNIE-AIO-Base](#-ernie-aio-base)
41
+ - [⚑ ERNIE-AIO-Turbo](#-ernie-aio-turbo)
42
+ - [πŸ”„ FP8 vs BF16](#-fp8-vs-bf16)
43
+ - [βš™οΈ Common Settings](#️-common-settings)
44
+ - [πŸ—‚οΈ Prompt Enhancer](#️-prompt-enhancer)
45
+ - [πŸ’‘ Prompting](#-prompting)
46
+ - [πŸ“₯ Downloads](#-downloads)
47
+ - [πŸ“ Files](#-files)
48
+ - [πŸ™ Credits](#-credits)
49
+
 
 
 
 
 
 
50
  ---
51
 
52
+ ## πŸ“¦ Models
 
53
 
54
+ | Model | Style | FP8 | BF16 | Best For |
55
+ |---|---|---|---|---|
56
+ | πŸ“Έ **ERNIE-AIO-Base** | Realistic / Structured | βœ… | βœ… | High-quality, refined output |
57
+ | ⚑ **ERNIE-AIO-Turbo** | Realistic / Structured | βœ… | βœ… | Fast iteration, 8-step generation |
58
 
59
+ **All models:** Integrated Text Encoder + VAE β€’ Single Checkpoint β€’ No extra loaders needed
60
 
61
+ ---
 
62
 
63
+ ## πŸ“‹ Workflow
 
64
 
65
+ | File | Description |
66
+ |---|---|
67
+ | `Workflow/ERNIE-AIO-Upscale.json` | Standard ComfyUI workflow with upscaler |
68
 
69
+ > βœ… A ready-to-use **ComfyUI workflow** is included in the `Workflow/` folder.
70
+ > βœ… Optional **Prompt Enhancer** files are included in the `prompt-enhancer/` folder.
71
 
72
+ ---
 
73
 
74
+ ## πŸ“Έ ERNIE-AIO-Base
75
 
76
+ The Base version is intended for high-quality, deliberate, and stable generation.
77
 
78
+ **Features:** πŸ“– Strong instruction following β€’ πŸ–‹οΈ Text rendering β€’ πŸ“ Structured layouts
 
 
 
 
79
 
80
+ ```
81
+ Steps: 50 | CFG: 4.0
82
+ ```
83
 
84
+ [![ERNIE-AIO-Base preview](https://huggingface.co/SeeSee21/ERNIE-AIO/resolve/main/images/ernie-aio-base.png)](https://huggingface.co/SeeSee21/ERNIE-AIO/blob/main/images/ernie-aio-base.png)
 
85
 
86
+ Use this version when you want:
87
+ - Stronger refinement and more careful rendering
88
+ - Full non-turbo quality
89
+ - Best results for posters, infographics, and text-heavy layouts
90
 
91
+ ---
92
 
93
+ ## ⚑ ERNIE-AIO-Turbo
94
 
95
+ The Turbo version is a distilled variant for fast generation and quick iteration.
96
 
97
+ **Features:** ⚑ 8-step generation β€’ 🎯 High prompt fidelity β€’ πŸš€ Quick turnaround
98
+
99
+ ```
100
+ Steps: 8 | CFG: 1.0
101
+ ```
102
+
103
+ [![ERNIE-AIO-Turbo preview](https://huggingface.co/SeeSee21/ERNIE-AIO/resolve/main/images/ernie-aio-turbo.png)](https://huggingface.co/SeeSee21/ERNIE-AIO/blob/main/images/ernie-aio-turbo.png)
104
 
105
+ Use this version when you want:
106
+ - Fast output for prototyping and iteration
107
+ - Strong visual quality with minimal steps
108
+ - Efficient VRAM usage
109
 
110
+ ---
 
111
 
112
+ ## πŸ”„ FP8 vs BF16
113
 
114
+ | | 🟑 FP8 | 🌟 BF16 |
115
+ |---|---|---|
116
+ | **Quality** | ~95% | 100% |
117
+ | **File Size** | Smaller | Full |
118
+ | **Speed** | Faster | Standard |
119
+ | **VAE** | FP32 (stable) | FP32 (stable) |
120
+ | **Best For** | Daily use | Final renders |
121
 
122
+ > **Note:** The FP8 versions in this repository were quantized by me.
123
+ > VAE is kept at **FP32** in all variants for stable decoding.
124
 
125
+ ---
 
126
 
127
+ ## βš™οΈ Common Settings
 
 
128
 
129
+ ```
130
+ # ERNIE-AIO-Base
131
+ Steps: 50
132
+ CFG: 4.0
133
 
134
+ # ERNIE-AIO-Turbo
135
+ Steps: 8
136
+ CFG: 1.0
137
 
138
+ # Placement (both variants)
139
+ ComfyUI/models/checkpoints/
 
140
 
141
+ # Loader
142
+ Standard "Load Checkpoint" node β€” no extra VAE or text encoder loader needed
143
+ ```
144
 
145
+ **Minimal workflow:**
146
+ 1. **Checkpoint Loader**
147
+ 2. **CLIP Text Encode (Prompt)**
148
+ 3. **Empty Latent Image**
149
+ 4. **KSampler**
150
+ 5. **VAE Decode**
151
+ 6. **Save Image**
152
 
153
+ ---
154
 
155
+ ## πŸ—‚οΈ Prompt Enhancer
 
 
156
 
157
+ An optional **Prompt Enhancer** is included in the `prompt-enhancer/` folder.
158
 
159
+ - Automatically expands short prompts into richer, more descriptive prompts
160
+ - Used by the included workflow
161
+ - Fully optional β€” works without it too
162
 
163
+ ---
164
 
165
+ ## πŸ’‘ Prompting
 
166
 
167
+ **Natural language works best!** These models excel at instruction following and text rendering.
 
168
 
169
+ ```
170
+ βœ… A cinematic movie poster with a bold readable title at the top,
171
+ dramatic lighting, a futuristic skyline in the background,
172
+ and a heroic figure silhouetted against a neon-lit city.
 
 
 
 
 
 
173
 
174
+ ❌ movie poster, city, hero, cinematic, dramatic
175
+ ```
 
 
 
 
 
 
176
 
177
+ **ERNIE-Image is especially strong for:**
 
 
 
 
178
 
179
+ | Category | Examples |
180
+ |---|---|
181
+ | 🎬 Posters & Title Cards | Movie posters, event flyers, title images |
182
+ | β˜• Product Ads | Coffee brands, packaging with readable text |
183
+ | πŸ“Š Infographics | Labeled sections, structured layouts |
184
+ | πŸ’¬ Comics | Speech bubbles, panel compositions |
185
+ | πŸ–₯️ UI-style Visuals | Interface mockups, app screens |
186
+ | πŸ–ΌοΈ Realistic Imagery | Portraits, environments, product shots |
187
 
188
+ **For best results, describe:**
189
+ - Subject & environment
190
+ - Composition & layout
191
+ - Mood & lighting
192
+ - Camera framing
193
+ - Any text that should appear in the image
194
 
195
+ ---
196
 
197
+ ## πŸ“₯ Downloads
 
 
 
198
 
199
+ ### ERNIE-AIO Checkpoints
200
 
201
+ | Model | Precision | CivitAI |
202
+ |---|---|---|
203
+ | ⚑ **ERNIE-AIO-Turbo** | 🌟 BF16 | [Download](https://civitai.red/models/2551548?modelVersionId=2868512) |
204
+ | ⚑ **ERNIE-AIO-Turbo** | 🟑 FP8 | [Download](https://civitai.red/models/2551548?modelVersionId=2868512) |
205
+ | πŸ“Έ **ERNIE-AIO-Base** | 🌟 BF16 | [Download](https://civitai.red/models/2551548?modelVersionId=2867462) |
206
+ | πŸ“Έ **ERNIE-AIO-Base** | 🟑 FP8 | [Download](https://civitai.red/models/2551548?modelVersionId=2867462) |
207
 
208
+ > All files are also available directly on this HuggingFace page.
 
 
 
209
 
210
+ ---
211
 
212
+ ## πŸ“ Files
213
+
214
+ ```
215
+ β”œβ”€β”€ ERNIE-AIO-Base-bf16.safetensors # πŸ“Έ Base β€” BF16
216
+ β”œβ”€β”€ ERNIE-AIO-Base-fp8.safetensors # πŸ“Έ Base β€” FP8
217
+ β”œβ”€β”€ ERNIE-AIO-Turbo-bf16.safetensors # ⚑ Turbo β€” BF16
218
+ β”œβ”€β”€ ERNIE-AIO-Turbo-fp8.safetensors # ⚑ Turbo β€” FP8
219
+ β”‚
220
+ β”œβ”€β”€ Workflow/
221
+ β”‚ └── ERNIE-AIO-Upscale.json # πŸš€ ComfyUI workflow with upscaler
222
+ β”‚
223
+ β”œβ”€β”€ prompt-enhancer/ # πŸ—‚οΈ Optional Prompt Enhancer files
224
+ β”‚
225
+ └── images/
226
+ β”œβ”€β”€ ernie-aio-base.png # πŸ“Έ Base preview image
227
+ └── ernie-aio-turbo.png # ⚑ Turbo preview image
228
+ ```
229
 
230
+ ---
 
231
 
232
+ ## πŸ™ Credits
 
233
 
234
+ **Original Model:** Baidu ERNIE-Image Team β€’ Single-Stream DiT β€’ Apache 2.0
 
 
 
235
 
236
+ **AIO Conversion & FP8 Quantization:** [SeeSee21](https://huggingface.co/SeeSee21)
 
 
 
 
 
 
237
 
238
+ **Official Sources:**
 
239
 
240
+ | Resource | Link |
241
+ |---|---|
242
+ | ERNIE-Image (HuggingFace) | [baidu/ERNIE-Image](https://huggingface.co/baidu/ERNIE-Image) |
243
+ | ERNIE-Image-Turbo (HuggingFace) | [baidu/ERNIE-Image-Turbo](https://huggingface.co/baidu/ERNIE-Image-Turbo) |
244
+ | GitHub | [baidu/ERNIE-Image](https://github.com/baidu/ERNIE-Image) |
245
 
246
+ ---
 
247
 
248
+ **Download β†’ Load Checkpoint β†’ Generate!** πŸš€
 
249
 
250
+ **Place in `checkpoints/`, load with "Load Checkpoint", and generate in seconds!**