manual-gen-images / README.md
OrianaRueckert's picture
Upload folder using huggingface_hub
cd1be53 verified
# Manual Prompt-Aligned AI Image Dataset
## Description
This dataset contains manually generated AI images created from a fixed, predefined set of prompts. Each image is directly aligned to its corresponding prompt file and generated across multiple image generation models.
## Prompt Alignment
Each image is generated directly from a corresponding prompt file found in ./prompts.
## Directory Structure
```
images/
β”œβ”€β”€ dalle/
β”‚ β”œβ”€β”€ p001.png
β”‚ β”œβ”€β”€ p002.png
β”‚ └── ...
β”œβ”€β”€ midjourney/
β”‚ β”œβ”€β”€ p001.png
β”‚ └── ...
β”œβ”€β”€ bing/
β”‚ β”œβ”€β”€ p001.jpg
β”‚ └── ...
β”œβ”€β”€ stable_diffusion/
β”‚ β”œβ”€β”€ p001.png
β”‚ └── ...
β”œβ”€β”€ ideogram/
β”‚ β”œβ”€β”€ p001.png
β”‚ └── ...
└── nanobanana/
β”œβ”€β”€ p001.png
└── ...
prompts/
β”œβ”€β”€ p001.txt
└── ...
```
## Generators Used
- DALL-E
- Midjourney (In Progress)
- Bing (MAI-Image-1 Model)
- Stable Diffusion (In Progress)
- Ideogram (In Progress)
- FLUX (skipped due to required paid credits)
- Nano Banana
## Metadata Inference
Metadata is derived implicitly from file paths:
- `generator` β†’ parent directory name under `images/`
- `prompt_id` β†’ filename (e.g., `p042`)
- `prompt_text` β†’ contents of `prompts/p042.txt`
Example derived record:
```python
{
"image_path": ".../images/dalle/p042.png", # note that all images in bing folder are jpg
"generator": "dalle",
"prompt_id": "p042",
"prompt_text": "...contents of prompts/p042.txt..."
}
```
## Limitations
This dataset was manually generated and is therefore limited in scale and generator coverage.