zooeyy commited on
Commit
d12169b
·
verified ·
0 Parent(s):

initial commit

Browse files
Files changed (2) hide show
  1. .gitattributes +55 -0
  2. README.md +110 -0
.gitattributes ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.lz4 filter=lfs diff=lfs merge=lfs -text
12
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
13
+ *.model filter=lfs diff=lfs merge=lfs -text
14
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
15
+ *.npy filter=lfs diff=lfs merge=lfs -text
16
+ *.npz filter=lfs diff=lfs merge=lfs -text
17
+ *.onnx filter=lfs diff=lfs merge=lfs -text
18
+ *.ot filter=lfs diff=lfs merge=lfs -text
19
+ *.parquet filter=lfs diff=lfs merge=lfs -text
20
+ *.pb filter=lfs diff=lfs merge=lfs -text
21
+ *.pickle filter=lfs diff=lfs merge=lfs -text
22
+ *.pkl filter=lfs diff=lfs merge=lfs -text
23
+ *.pt filter=lfs diff=lfs merge=lfs -text
24
+ *.pth filter=lfs diff=lfs merge=lfs -text
25
+ *.rar filter=lfs diff=lfs merge=lfs -text
26
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
27
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
29
+ *.tar filter=lfs diff=lfs merge=lfs -text
30
+ *.tflite filter=lfs diff=lfs merge=lfs -text
31
+ *.tgz filter=lfs diff=lfs merge=lfs -text
32
+ *.wasm filter=lfs diff=lfs merge=lfs -text
33
+ *.xz filter=lfs diff=lfs merge=lfs -text
34
+ *.zip filter=lfs diff=lfs merge=lfs -text
35
+ *.zst filter=lfs diff=lfs merge=lfs -text
36
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
37
+ # Audio files - uncompressed
38
+ *.pcm filter=lfs diff=lfs merge=lfs -text
39
+ *.sam filter=lfs diff=lfs merge=lfs -text
40
+ *.raw filter=lfs diff=lfs merge=lfs -text
41
+ # Audio files - compressed
42
+ *.aac filter=lfs diff=lfs merge=lfs -text
43
+ *.flac filter=lfs diff=lfs merge=lfs -text
44
+ *.mp3 filter=lfs diff=lfs merge=lfs -text
45
+ *.ogg filter=lfs diff=lfs merge=lfs -text
46
+ *.wav filter=lfs diff=lfs merge=lfs -text
47
+ # Image files - uncompressed
48
+ *.bmp filter=lfs diff=lfs merge=lfs -text
49
+ *.gif filter=lfs diff=lfs merge=lfs -text
50
+ *.png filter=lfs diff=lfs merge=lfs -text
51
+ *.tiff filter=lfs diff=lfs merge=lfs -text
52
+ # Image files - compressed
53
+ *.jpg filter=lfs diff=lfs merge=lfs -text
54
+ *.jpeg filter=lfs diff=lfs merge=lfs -text
55
+ *.webp filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,110 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - text-to-image
4
+ - lora
5
+ - diffusers
6
+ - template:diffusion-lora
7
+ widget:
8
+ - output:
9
+ url: images/ComfyUI_temp_pnapt_00077_.png
10
+ text: 图1变为图2风格
11
+ - output:
12
+ url: images/ComfyUI_temp_pnapt_00074_.png
13
+ text: 图1变为图2风格
14
+ - output:
15
+ url: images/ComfyUI_temp_pnapt_00073_.png
16
+ text: 图1变为图2风格
17
+ - output:
18
+ url: images/ComfyUI_temp_pnapt_00072_.png
19
+ text: 图1变为图2风格
20
+ base_model: Qwen/Qwen-Image-Edit-2511
21
+ instance_prompt: 图1变为图2风格
22
+ license: mit
23
+ ---
24
+ # Style Transfer-Alpha0.1
25
+
26
+ <Gallery />
27
+
28
+ ## Model description
29
+
30
+ 🎨 Qwen-Image-Edit 风格模仿 LoRA 模型 v0.1(Alpha)
31
+
32
+ 这是一个基于 [Qwen-Image-Edit-2511](https:&#x2F;&#x2F;huggingface.co&#x2F;Qwen&#x2F;Qwen-Image-Edit-2511) 的实验性LoRA微调模型,专注于风格转换与艺术变换。使用400多组图像训练了22000步,rank为32,学习率为0.0001,训练分辨率为1024,并在NVIDIA RTX 4090上实现了约每迭代15秒的速度。
33
+
34
+ 🎨 Qwen-Image-Edit Style Mimic LoRA — v0.1 (Alpha)
35
+
36
+ An experimental LoRA fine-tuned model based on [Qwen-Image-Edit-2511](https:&#x2F;&#x2F;huggingface.co&#x2F;Qwen&#x2F;Qwen-Image-Edit-2511), designed for style transfer and artistic transformation. Trained with over 400 image pairs for 22,000 steps at rank 32, using a learning rate of 0.0001 and training resolution of 1024, achieving approximately 15 seconds per iteration on an NVIDIA RTX 4090.
37
+ 🔍 工作原理
38
+
39
+ 给定:
40
+ 一张源图像(例如,一个人的照片)
41
+ 一张参考图像(例如,卡通、线稿、插画等)
42
+
43
+ ![微信图片_20251230170631_361_150](https:&#x2F;&#x2F;cdn-uploads.huggingface.co&#x2F;production&#x2F;uploads&#x2F;671e48a732f6aa242c8c5de8&#x2F;2F2WX8_Gsfe7dJQKj4Quv.png)
44
+
45
+ 模型将参考图像的视觉风格应用于源图像,同时保留其结构和构图。虽然它仍处于早期阶段,但在ComfyUI中已经能够对部分风格产生良好的效果。
46
+ ✅ 示例:将cosplay照片转化为Lacoste鳄鱼素描风格、色彩斑斓的波普艺术狗或极简主义冬季卡通——只需一键。
47
+ 🔍 How It Works
48
+
49
+ Given:
50
+ A source image (e.g., a photo of a person)
51
+ A reference image (e.g., cartoon, line art, illustration, etc.)
52
+
53
+ The model applies the visual style of the reference image to the source image while preserving its structure and composition. Although still in its early stages, it has shown promising results for certain styles in ComfyUI.
54
+ ✅ Example: Turn a cosplay photo into a Lacoste-style crocodile sketch, a colorful pop-art dog, or a minimalist winter cartoon — all with one click.
55
+ 🛠️ 使用指南
56
+ 在 ComfyUI 中:
57
+ 1. 加载你的源图像(例如,一张照片)。
58
+ 2. 加载一个风格参考图像。
59
+ 3. 应用此 LoRA,强度设置为0.6–1.0。
60
+ 4. 使用图像到图像或修复节点生成结果。
61
+
62
+ 尽管该模型在某些风格上表现良好,但它仍在开发中,可能会遇到一些局限性。
63
+ In SD WebUI:
64
+ Load the LoRA via &quot;Load LoRA&quot; tab.
65
+ Set LoRA weight to 0.7–1.0.
66
+ Use with img2img mode and a reference image as input.
67
+ 💡 Tip: For best results, use references with similar aspect ratios and compositions.
68
+ 🧪 训练详情
69
+ 基础模型: Qwen-Image-Edit-2511
70
+ 训练方法: LoRA 微调(Rank&#x3D;32, Alpha&#x3D;16)
71
+ 数据集: 自定义精选的艺术风格数据集(线稿、卡通、波普艺术、超现实主义等),包含超过400组图像
72
+ Epochs: 22000 步
73
+ Batch Size: 1
74
+ 学习率: 1e-4
75
+ 优化器: AdamW
76
+ 训练硬件: NVIDIA RTX 4090,大约每迭代15秒
77
+ ⚠️ 局限性与未来工作
78
+
79
+ 尽管前景看好,但该模型仍处于早期发展阶段。当前的局限性包括:
80
+ 转换过程中面部特征可能会有些模糊
81
+ 色彩一致性在不同风格间可能有所不同
82
+ 复杂纹理可能无法完全转移
83
+
84
+ 📌 未来改进:
85
+ 使用遮罩增强面部保护
86
+ 添加色彩校正损失
87
+ 支持更多样化的参考风格
88
+ 在更高分辨率的图像上进行训练
89
+ 🔄 版本控制
90
+ v0.1(Alpha): 初始发布——实验性但功能正常
91
+ v0.2+: 预计很快推出,具有改进的稳定性和准确性
92
+ 📂 License
93
+
94
+ 该模型根据MIT许可证发布。您可以在任何目的下自由使用、修改和分发它,包括商业应用,只需注明原作者即可。
95
+ 📣 反馈与贡献
96
+
97
+ 我正在积极改进这个模型!如果您有任何建议、发现错误或想要提供示例,请打开一个问题或留下评论。
98
+
99
+ 让我们一起让风格转换更加强大吧!🌟
100
+ 📝 Created by @zooeyy Still training... stay tuned!
101
+
102
+ ## Trigger words
103
+
104
+ You should use `图1变为图2风格` to trigger the image generation.
105
+
106
+
107
+ ## Download model
108
+
109
+
110
+ [Download](/zooeyy/Style-Transfer/tree/main) them in the Files & versions tab.