simpson-lora / README.md
Foxintohumanbeing's picture
Update README.md
e47dcb3
---
license: creativeml-openrail-m
base_model: Norod78/sd-simpsons-model
datasets:
- JerryMo/Modified-Caption-Train-Set
instance_prompt: The Simpsons
tags:
- stable-diffusion
- stable-diffusion-diffusers
- text-to-image
- diffusers
- lora
---
**Github Repo**
The detailed work description and code can be found in https://github.com/foxintohumanbeing/DDA4210_Group_project.
The creation of high-quality image content from text descriptions is a challenging yet highly desirable task in the field
of artificial intelligence. We focus on the Simpsons, a popular animated series. Based on pretrained SOTA model, we
investigate in obtaining high-quality dataset and efficient fine-tuning methods. We explore the options of manually
creating the dataset and using different fine-tuning techniques such as simple baseline, LoRA, and Dreambooth. Our
approach involves combining the advantages of each option to achieve better results.
We propose dataset collection method and fine-tuning model(Simspon Artistic Memory). Moreover, to better
illustrating our results, we create two APPs, one for generating images and one for annotating the images (you can find them in github link provided). By improving
data collection and fine-tuning techniques on Simpsons, we hope to push the boundaries of what is achievable in the
text-to-image synthesis domain and inspire further research in this area.
**Prompts Format**
"The Simpsons. a [closeup?] of a [emotional expression] [race] [X year old] [man / woman / etc.], with [hair and makeup style], wearing [clothing style] while [doing] near [nearby objects],[outside / inside] with [objects / color ] in the background,in [time period]."
**Contact**
For any questions, please contact me at 120090438@link.cuhk.edu.cn