Commit ·
558dcc6
1
Parent(s): 7c7a424
Update README.md
Browse files
README.md
CHANGED
|
@@ -11,5 +11,21 @@ tags:
|
|
| 11 |
- diffusers
|
| 12 |
- lora
|
| 13 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
|
| 15 |
-
114514
|
|
|
|
| 11 |
- diffusers
|
| 12 |
- lora
|
| 13 |
---
|
| 14 |
+
**Github Repo**
|
| 15 |
+
The detailed work description and code can be found in https://github.com/foxintohumanbeing/DDA4210_Group_project.
|
| 16 |
+
|
| 17 |
+
The creation of high-quality image content from text descriptions is a challenging yet highly desirable task in the field
|
| 18 |
+
of artificial intelligence. We focus on the Simpsons, a popular animated series. Based on pretrained SOTA model, we
|
| 19 |
+
investigate in obtaining high-quality dataset and efficient fine-tuning methods. We explore the options of manually
|
| 20 |
+
creating the dataset and using different fine-tuning techniques such as simple baseline, LoRA, and Dreambooth. Our
|
| 21 |
+
approach involves combining the advantages of each option to achieve better results.
|
| 22 |
+
|
| 23 |
+
We propose dataset collection method and fine-tuning model(Simspon Artistic Memory). Moreover, to better
|
| 24 |
+
illustrating our results, we create two APPs, one for generating images and one for annotating the images (you can find them in github link provided). By improving
|
| 25 |
+
data collection and fine-tuning techniques on Simpsons, we hope to push the boundaries of what is achievable in the
|
| 26 |
+
text-to-image synthesis domain and inspire further research in this area.
|
| 27 |
+
|
| 28 |
+
**Prompts Format**
|
| 29 |
+
"The Simpsons. a [closeup?] of a [emotional expression] [race] [X year old] [man / woman / etc.], with [hair and makeup style], wearing [clothing style] while [doing] near [nearby objects],[outside / inside] with [objects / color ] in the background,in [time period]."
|
| 30 |
+
|
| 31 |
|
|
|