starsfriday commited on
Commit
c242f3c
·
verified ·
1 Parent(s): 1bbb25e

Upload folder using huggingface_hub

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. README.md +8 -21
  3. preview/input.png +3 -0
.gitattributes CHANGED
@@ -39,3 +39,4 @@ preview/result2.png filter=lfs diff=lfs merge=lfs -text
39
  preview/result3.png filter=lfs diff=lfs merge=lfs -text
40
  preview/result4.png filter=lfs diff=lfs merge=lfs -text
41
  preview/result5.png filter=lfs diff=lfs merge=lfs -text
 
 
39
  preview/result3.png filter=lfs diff=lfs merge=lfs -text
40
  preview/result4.png filter=lfs diff=lfs merge=lfs -text
41
  preview/result5.png filter=lfs diff=lfs merge=lfs -text
42
+ preview/input.png filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -14,19 +14,6 @@ tags:
14
  - DiT
15
  - Qwen-Image
16
  - ValiantCat
17
- widget:
18
- - text: >-
19
- improve the composition and visual consistency of the image while maintaining style and realism.
20
- output:
21
- url: preview/sample1.png
22
- - text: >-
23
- enhance aesthetic appeal and global color harmony of the photo.
24
- output:
25
- url: preview/sample2.png
26
- - text: >-
27
- recompose the scene to improve perspective consistency and balance.
28
- output:
29
- url: preview/sample3.png
30
  ---
31
 
32
  <p align="center">
@@ -35,9 +22,9 @@ widget:
35
 
36
  ---
37
 
38
- # 🌈 starsfriday Qwen-Image-Edit-DiT-MeiTu
39
 
40
- This model — **Qwen-Image-Edit-DiT-MeiTu** — is an improved variant of [Qwen/Qwen-Image-Edit](https://huggingface.co/starsfriday/Qwen-Image-Edit-MeiTu), built with **DiT-based architecture fine-tuning** to enhance **visual consistency**, **aesthetic quality**, and **structural alignment** in complex edits.
41
 
42
  Developed by **Valiant Cat AI Lab**, this version aims to further close the gap between high-fidelity semantic editing and coherent artistic rendering, achieving a more natural and professional output across a wide range of prompts and subjects.
43
 
@@ -75,7 +62,7 @@ from PIL import Image
75
  from diffusers import QwenImageEditPipeline
76
 
77
  # Load the enhanced pipeline
78
- pipeline = QwenImageEditPipeline.from_pretrained("starsfriday/Qwen-Image-Edit-DiT-Enhanced")
79
  pipeline.to(torch.bfloat16)
80
  pipeline.to("cuda")
81
 
@@ -104,11 +91,11 @@ with torch.inference_mode():
104
 
105
  Below are examples of **consistency and aesthetic improvement** in complex editing scenarios:
106
 
107
- | Task | Before | After |
108
- |------|---------|-------|
109
- | **Portrait lighting enhancement** | ![](result/sample1.png) | ![](result/sample1_out.png) |
110
- | **Scene recomposition with better perspective** | ![](result/sample2.png) | ![](result/sample2_out.png) |
111
- | **Global color harmony & fine detail restoration** | ![](result/sample3.png) | ![](result/sample3_out.png) |
112
 
113
  ---
114
 
 
14
  - DiT
15
  - Qwen-Image
16
  - ValiantCat
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  ---
18
 
19
  <p align="center">
 
22
 
23
  ---
24
 
25
+ # 🌈 starsfriday Qwen-Image-Edit-MeiTu
26
 
27
+ This model — **Qwen-Image-Edit-MeiTu** — is an improved variant of [Qwen/Qwen-Image-Edit](https://huggingface.co/starsfriday/Qwen-Image-Edit-MeiTu), built with **DiT-based architecture fine-tuning** to enhance **visual consistency**, **aesthetic quality**, and **structural alignment** in complex edits.
28
 
29
  Developed by **Valiant Cat AI Lab**, this version aims to further close the gap between high-fidelity semantic editing and coherent artistic rendering, achieving a more natural and professional output across a wide range of prompts and subjects.
30
 
 
62
  from diffusers import QwenImageEditPipeline
63
 
64
  # Load the enhanced pipeline
65
+ pipeline = QwenImageEditPipeline.from_pretrained("starsfriday/Qwen-Image-Edit-MeiTu")
66
  pipeline.to(torch.bfloat16)
67
  pipeline.to("cuda")
68
 
 
91
 
92
  Below are examples of **consistency and aesthetic improvement** in complex editing scenarios:
93
 
94
+ | Task | input & output
95
+ |------|---------|
96
+ | **Portrait lighting enhancement** | ![](preview/sample1.png)
97
+ | **Scene recomposition with better perspective** | ![](preview/sample2.png)|
98
+ | **Global color harmony & fine detail restoration** | ![](preview/sample3.png)
99
 
100
  ---
101
 
preview/input.png ADDED

Git LFS Details

  • SHA256: e0c61cb66e10a8b0e7f7901b8cecd759535628d86583b474af25b16316d84dd9
  • Pointer size: 132 Bytes
  • Size of remote file: 1.29 MB