Upload Low-Res_Input_Img_Repair/README.md with huggingface_hub
Browse files
Low-Res_Input_Img_Repair/README.md
ADDED
|
@@ -0,0 +1,64 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
language:
|
| 3 |
+
- en
|
| 4 |
+
tags:
|
| 5 |
+
- art
|
| 6 |
+
---
|
| 7 |
+
|
| 8 |
+
# Qwen-Image-Edit Lowres-Fix (Input Image Repair)
|
| 9 |
+
|
| 10 |
+
**Creator**: [Cyph3r](https://civitai.com/user/Cyph3r)
|
| 11 |
+
**Type**: LORA
|
| 12 |
+
**Base Model**: Qwen
|
| 13 |
+
**Version**: Low-Res Input Img Repair
|
| 14 |
+
**Trigger Words**: `N/A`
|
| 15 |
+
|
| 16 |
+
**Civitai Model ID**: 1889350
|
| 17 |
+
**Civitai Version ID**: 2138532
|
| 18 |
+
|
| 19 |
+
**Stats (at time of fetch for this version)**:
|
| 20 |
+
* Downloads: 530
|
| 21 |
+
* Rating: 0 (0 ratings)
|
| 22 |
+
* Favorites: N/A
|
| 23 |
+
|
| 24 |
+
---
|
| 25 |
+
|
| 26 |
+
## 📄 Description (Parent Model)
|
| 27 |
+
|
| 28 |
+
Qwen-Image-Edit Low-Resolution Input Repair LoRA
|
| 29 |
+
Model Introduction
|
| 30 |
+
Qwen-Image-Edit
|
| 31 |
+
is a powerful open-source image editing model. However, when the input resolution of the model is lower than the target resolution for image generation, the model's ability to maintain image details is poor. To address this, we made the following two modifications:
|
| 32 |
+
Rope Interpolation: The position encoding of the input image in Qwen-Image DiT is changed to an interpolated sampling of the position encoding at the target resolution. This modification can take effect independently of modification 2.
|
| 33 |
+
LoRA Fine-tuning: Quickly train a LoRA model to enhance the generalization of this interpolated encoding by DiT.
|
| 34 |
+
With these two modifications, the model can produce consistent edited images even when given low-resolution input. Additionally, compared to high-resolution input, the inference time of the model is significantly reduced.
|
| 35 |
+
Source:
|
| 36 |
+
https://modelscope.cn/models/DiffSynth-Studio/Qwen-Image-Edit-Lowres-Fix
|
| 37 |
+
|
| 38 |
+
## Version Notes (Low-Res Input Img Repair)
|
| 39 |
+
|
| 40 |
+
Qwen-Image-Edit Low-Resolution Input Repair LoRA
|
| 41 |
+
Model Introduction
|
| 42 |
+
Qwen-Image-Edit
|
| 43 |
+
is a powerful open-source image editing model. However, when the input resolution of the model is lower than the target resolution for image generation, the model's ability to maintain image details is poor. To address this, we made the following two modifications:
|
| 44 |
+
Rope Interpolation: The position encoding of the input image in Qwen-Image DiT is changed to an interpolated sampling of the position encoding at the target resolution. This modification can take effect independently of modification 2.
|
| 45 |
+
LoRA Fine-tuning: Quickly train a LoRA model to enhance the generalization of this interpolated encoding by DiT.
|
| 46 |
+
With these two modifications, the model can produce consistent edited images even when given low-resolution input. Additionally, compared to high-resolution input, the inference time of the model is significantly reduced.
|
| 47 |
+
https://modelscope.cn/models/DiffSynth-Studio/Qwen-Image-Edit-Lowres-Fix
|
| 48 |
+
|
| 49 |
+
---
|
| 50 |
+
|
| 51 |
+
## Civitai Links
|
| 52 |
+
|
| 53 |
+
* **[🔗 View This Version on Civitai →](https://civitai.com/models/1889350?modelVersionId=2138532)**
|
| 54 |
+
* [View Full Model Page →](https://civitai.com/models/1889350)
|
| 55 |
+
* [View Creator Profile →](https://civitai.com/user/Cyph3r)
|
| 56 |
+
|
| 57 |
+
---
|
| 58 |
+
|
| 59 |
+
## File Information
|
| 60 |
+
|
| 61 |
+
* **Filename**: `Qwen-Image-Edit-Lowres-Fix.safetensors`
|
| 62 |
+
* **Size**: 450.18 MB
|
| 63 |
+
* **Hash (AutoV2)**: `1D5E579C97`
|
| 64 |
+
* **Hash (SHA256)**: `1D5E579C975D4810CCA93BFE7EBE59C4ADFF23DDD8397B052D3F82F263FCDD76`
|