Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,34 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Color Diffusion (Evaluating Model Perception of Color Illusions in Photorealistic Scenes)
|
| 2 |
+
|
| 3 |
+
Authors: Lingjun Mao, Zineng Tang, Alane Suhr
|
| 4 |
+
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+

|
| 8 |
+
|
| 9 |
+
|
| 10 |
+
## Model Overview
|
| 11 |
+
|
| 12 |
+
The **Color Diffusion** model used in the paper "Evaluating Model Perception of Color Illusions in Photorealistic Scenes" is designed to generate images for RCID dataset based on a color sketch. By simply providing the model with a colored draft image, it can generate realistic images that match both the shape and color patterns of the provided sketch, according to a given text prompt. This model is built upon ControlNet and has been trained for 20 epochs on the MS COCO 2017 dataset.
|
| 13 |
+
|
| 14 |
+
## RCID Dataset
|
| 15 |
+
|
| 16 |
+

|
| 17 |
+
|
| 18 |
+
The construction of our dataset involves three steps:
|
| 19 |
+
|
| 20 |
+
1. **Image Generation.** For contrast and stripe illusions, we use procedural code to generate simple illusion images, which are then processed by our **Color Diffusion** model to create realistic illusion images. For filter illusions, we directly apply contrasting color filters to the original images. Each type of illusion also includes a corresponding control group without any illusions for comparison.
|
| 21 |
+
|
| 22 |
+
2. **Question Generation.** We use GPT-4o to generate image-specific questions that are designed to evaluate the model's understanding of the illusion.
|
| 23 |
+
|
| 24 |
+
3. **Human Feedback.** We collect human participants' feedback on these images and adjust the original classification of “illusion” and “non-illusion” based on whether participants are deceived.
|
| 25 |
+
|
| 26 |
+
Our data can be found in the following link: [RCID Dataset](https://huggingface.co/datasets/mao1207/RCID)
|
| 27 |
+
|
| 28 |
+
The code is released on [https://github.com/mao1207/RCID](https://huggingface.co/datasets/mao1207/RCID)
|
| 29 |
+
|
| 30 |
+
## License
|
| 31 |
+
|
| 32 |
+
The source code of this repository is released under the Apache License 2.0. The model license and dataset license are listed on their corresponding webpages.
|
| 33 |
+
|
| 34 |
+
For more information, access to the dataset, and to contribute, please visit our [Website](https://color-illusion.github.io/Color-Illusion/).
|