|
|
--- |
|
|
license: mit |
|
|
datasets: |
|
|
- SubMaroon/danbooru-lineart |
|
|
base_model: |
|
|
- cagliostrolab/animagine-xl-3.0 |
|
|
--- |
|
|
|
|
|
# Experimental ControlNet (Low Quality / Research Prototype) |
|
|
|
|
|
> **Experimental model. Low quality. Not intended for production use.** |
|
|
> This ControlNet was trained as a research experiment to explore line-based conditioning and colorization behavior in SDXL anime models. |
|
|
|
|
|
--- |
|
|
|
|
|
## Model Summary |
|
|
|
|
|
This repository contains an **experimental ControlNet for SDXL**, trained on anime-style images. |
|
|
The model is **not stable**, shows **inconsistent color behavior**, and should be treated as a **research prototype** rather than a finished or polished solution. |
|
|
|
|
|
The goal of this experiment was to understand: |
|
|
- How SDXL ControlNet learns **colorization from line-based conditioning** |
|
|
- How different conditioning types (Canny vs Lineart) affect **color consistency** |
|
|
|
|
|
--- |
|
|
|
|
|
## Base Model |
|
|
|
|
|
- **Base model:** `cagliostrolab/animagine-xl-3.0` |
|
|
- **Architecture:** ControlNet SDXL |
|
|
- **Training framework:** 🤗 Diffusers |
|
|
- **Precision:** `bf16` |
|
|
|
|
|
--- |
|
|
|
|
|
## Conditioning Type |
|
|
|
|
|
- Primary conditioning: **Lineart / Canny-like edges** |
|
|
- Backgrounds are mostly white |
|
|
- Line quality varies (mostly clean, some noisy samples) |
|
|
|
|
|
> Important limitation: |
|
|
> Lineart / Canny **does not contain color information**, which leads to unstable and drifting color predictions. |
|
|
|
|
|
--- |
|
|
|
|
|
## Dataset |
|
|
|
|
|
- Size: ~**14,000 image pairs** |
|
|
- Format: |
|
|
- Original image (color) |
|
|
- Conditioning image (lineart / canny) |
|
|
- Prompt (caption) |
|
|
|
|
|
### Known dataset issues |
|
|
- Some lineart images are **noisy or inconsistent** |
|
|
- Images are resized to square resolution (possible cropping artifacts) |
|
|
- No explicit color supervision |
|
|
- No palette or region-level color constraints |
|
|
|
|
|
--- |
|
|
|
|
|
## Training Configuration |
|
|
|
|
|
Typical training setup: |
|
|
|
|
|
```bash |
|
|
resolution: 768 |
|
|
train_batch_size: 2 |
|
|
gradient_accumulation_steps: 2 |
|
|
effective_batch_size: 4 |
|
|
learning_rate: 2e-5 |
|
|
lr_scheduler: cosine |
|
|
max_train_steps: 6000–8000 |
|
|
mixed_precision: bf16 |
|
|
|