Text-to-Image
stable-diffusion
sdxl
lora
BackGwa commited on
Commit
ffad7bd
·
verified ·
1 Parent(s): ecda5dd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -20
README.md CHANGED
@@ -34,13 +34,15 @@ base_model: BackGwa/LUMIERE-Q
34
  instance_prompt: >-
35
  backgwa, solo, pale blue hair, cat ears, golden eyes, navy beret, white shirt,
36
  red ribbon, oversized sleeves, blue jacket
37
- license: creativeml-openrail-m
 
 
38
  ---
39
 
40
  # Overview
41
- This LoRA is an SDXL-based character LoRA created to reproduce the `BACKGWA` character.
42
 
43
- This project was conducted for research purposes to protect the copyright, creative style, training rights, and intellectual property held by the original artist of this character.
44
  Instead of using images directly created by the artist as training data, this LoRA was trained only on an AI-generated synthetic image dataset.
45
 
46
  Through this approach, the project explores a way to build a character LoRA without directly using the original character images or the artist's work as training data.
@@ -55,13 +57,12 @@ Through this approach, the project explores a way to build a character LoRA with
55
  - Redistribution of this LoRA by third parties is prohibited in principle. However, it may be permitted if the original creator is clearly credited and proper attribution is provided.
56
 
57
  ## Trigger Word
58
- You can use the `BACKGWA` character by entering the trigger prompt below.
59
- This prompt includes the key visual features and default outfit elements required to generate the character.
60
 
61
- If you want to change specific details such as the outfit, expression, pose, background, or composition, you can keep the base trigger prompt and add or adjust the desired elements as needed.
 
62
 
63
  ```text
64
- backgwa, solo, pale blue hair, cat ears, golden eyes, navy beret, white shirt, red ribbon, oversized sleeves, blue jacket
65
  ```
66
 
67
  ## Usage
@@ -75,28 +76,27 @@ For the most stable and consistent results, it is recommended to use it with the
75
  ---
76
 
77
  # Training
 
78
  This LoRA was trained without using any human-made artwork as training data.
79
 
80
- The dataset consists of 28 synthetic images generated with GPT Image 2.
81
- These images were created specifically for this LoRA training process and were not collected from, derived from, or reused from artwork produced by human artists.
82
 
83
- To prepare the initial image-generation prompts, the original character image was provided as an image input to the locally executed [`gemma-4-E4B-it`](https://huggingface.co/google/gemma-4-E4B-it) model.
84
- Predefined prompts and instructions were used to guide the model in producing text prompts capable of reconstructing the character from the provided reference image.
85
 
86
- Using those reconstructed prompts, synthetic character images were generated with GPT Image 2.
87
- When suitable high-quality outputs were obtained, some of them were also used as additional references for further image generation.
88
 
89
- The generated images were then labeled using GPT-5.5.
90
- These labels describe the visible character features, outfit elements, composition, and other relevant image details used during training.
91
 
92
- A total of 28 synthetic images were used for training.
93
- The LoRA was trained on the SDXL-based [`LUMIERE-Q`](https://huggingface.co/BackGwa/LUMIERE-Q) model, with 10 repeats and 10 epochs at a resolution of `1024x1024`.
94
 
95
  |Parameter|Setting|
96
  |---|---|
97
  |Base Model|[`LUMIERE-Q`](https://huggingface.co/BackGwa/LUMIERE-Q)|
98
- |Model Type|`SDXL-based`|
99
- |Dataset Size|`28 images`|
100
  |Epochs|`10`|
101
  |Repeats|`10`|
102
- |Resolution|`1024x1024`|
 
34
  instance_prompt: >-
35
  backgwa, solo, pale blue hair, cat ears, golden eyes, navy beret, white shirt,
36
  red ribbon, oversized sleeves, blue jacket
37
+ license:
38
+ - creativeml-openrail-m
39
+ - mit
40
  ---
41
 
42
  # Overview
43
+ Character-LoRA is an SDXL-based character LoRA created to reproduce the `BACKGWA` character. This LoRA was trained on a copyright-conscious synthetic image dataset.
44
 
45
+ This project was conducted for research purposes to protect the copyright, creative style, training rights, and intellectual property held by the original artist of this character.
46
  Instead of using images directly created by the artist as training data, this LoRA was trained only on an AI-generated synthetic image dataset.
47
 
48
  Through this approach, the project explores a way to build a character LoRA without directly using the original character images or the artist's work as training data.
 
57
  - Redistribution of this LoRA by third parties is prohibited in principle. However, it may be permitted if the original creator is clearly credited and proper attribution is provided.
58
 
59
  ## Trigger Word
 
 
60
 
61
+ Use the trigger word below to activate the `BACKGWA` character.
62
+ You can add extra prompts for outfit, expression, pose, background, or composition as needed.
63
 
64
  ```text
65
+ backgwa
66
  ```
67
 
68
  ## Usage
 
76
  ---
77
 
78
  # Training
79
+
80
  This LoRA was trained without using any human-made artwork as training data.
81
 
82
+ The dataset consists of 48 synthetic images generated with GPT Image 2.
83
+ These images were created specifically for this LoRA training process, and were not collected from, derived from, or reused from artwork created by human artists.
84
 
85
+ To prepare the initial image-generation prompts, the original character image was provided as a reference image to the locally executed [`gemma-4-E4B-it`](https://huggingface.co/google/gemma-4-E4B-it) model.
86
+ Predefined prompts and instructions were used to guide the model in generating text prompts capable of reconstructing the character from the provided reference image.
87
 
88
+ Using these reconstructed prompts, synthetic character images were generated with GPT Image 2.
 
89
 
90
+ The generated images were then separately labeled using GPT-5.5.
91
+ These labels describe the visible visual features, outfit elements, composition, and other relevant details used during training.
92
 
93
+ A total of 48 synthetic images were used for training.
94
+ The LoRA was trained on the SDXL-based [`LUMIERE-Q`](https://huggingface.co/BackGwa/LUMIERE-Q) model at a resolution of `1024x1024`, with 10 repeats and 10 epochs.
95
 
96
  |Parameter|Setting|
97
  |---|---|
98
  |Base Model|[`LUMIERE-Q`](https://huggingface.co/BackGwa/LUMIERE-Q)|
99
+ |Dataset Size|`48 images`|
 
100
  |Epochs|`10`|
101
  |Repeats|`10`|
102
+ |Resolution|`1024x1024`|