File size: 5,036 Bytes
33a6aa2 ee8ac22 1fdc411 33a6aa2 ee8ac22 33a6aa2 ee8ac22 774aabe ee8ac22 33a6aa2 ee8ac22 33a6aa2 774aabe ee8ac22 774aabe 33a6aa2 ee8ac22 774aabe ee8ac22 774aabe 33a6aa2 774aabe 33a6aa2 ee8ac22 33a6aa2 774aabe 33a6aa2 774aabe 06fd5c5 774aabe 33a6aa2 774aabe ee8ac22 33a6aa2 ee8ac22 33a6aa2 774aabe 33a6aa2 774aabe ee8ac22 774aabe ee8ac22 774aabe ee8ac22 774aabe ee8ac22 33a6aa2 ee8ac22 33a6aa2 774aabe 33a6aa2 ee8ac22 774aabe ee8ac22 774aabe ee8ac22 774aabe 06fd5c5 774aabe ee8ac22 774aabe ee8ac22 33a6aa2 ee8ac22 774aabe | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 | ---
license: other
library_name: comfyui
pipeline_tag: image-to-image
tags:
- stable-diffusion
- stable-diffusion-diffusers
- image-to-image
- lora
- comfyui-workflow
- education
- portfolio
- art
- onetrainer
base_model: stabilityai/stable-diffusion-xl-base-1.0
---
# 🎨 CS x Design Convergence Project: Generative AI Pipeline & Workflow Archive
> **"Bridging Technical Logic with Aesthetic Sensibility"**
>
> This repository serves as a **Portfolio Archive** documenting the construction of Generative AI image generation pipelines and workflow optimization.
> As a result of an interdisciplinary curriculum merging **Computer Science and Design**, this project demonstrates the end-to-end process from data collection and model fine-tuning to the design of advanced inference workflows.
---
## 📋 1. Project Overview
The core objective of this project is to demonstrate the ability to **accurately train specific artistic styles** and implement them into **highly controllable workflows**, going beyond simple prompt engineering. It aims to prove both technical proficiency (Model Architecture, Latent Space understanding) and artistic expression (Style Transfer).
* **Key Activities:** Custom LoRA Training, Advanced ComfyUI Workflow Design, Automated Pipeline Scripting.
* **Tools Used:** ComfyUI, OneTrainer, Stable Diffusion, Python, Hugging Face.
---
## 🧠 2. Model Training Methodology: Kirochy Style LoRA
To replicate the unique style of the illustrator **Kirochy**, I conducted LoRA (Low-Rank Adaptation) training with a rigorous data processing approach.
### 2.1 Data Acquisition & Preprocessing
* **Data Source:** Aggregated reference illustrations from the artist's official portfolios ([Instagram @kirochy_00](https://www.instagram.com/kirochy_00/), X).
* **Preprocessing:** Implemented **OneTrainer** to handle various resolutions and aspect ratios via bucketing. Conducted detailed tagging to capture specific stylistic features (line art weight, color palettes, shading techniques).
### 2.2 Training Framework & Optimization
* **Engine:** Trained using **OneTrainer** for precise parameter control.
* **Optimization:** Adjusted Epochs and Learning Rates iteratively to balance between style fidelity and generalization, ensuring the model avoids overfitting while retaining the artist's signature touch.
---
## ⚙️ 3. Workflow Architecture: P2A (Photo to Anime) Pipeline
The `p2a.ai.json` file in this repository is a highly sophisticated **Img2Img Workflow** designed to convert real-world photos into Kirochy-style illustrations. To solve common structural distortion issues in style transfer, I engineered a multi-stage processing pipeline.
### 3.1 Technical Logic & Customization
This workflow is not a mere copy-paste; it is a **custom-built architecture** integrating various advanced techniques researched from diverse community workflows and technical documentation.
1. **ControlNet Integration (Structural Integrity):**
* Utilized ControlNet algorithms to strictly preserve the pose and depth information of the source image, preventing the "hallucinations" often seen in generative models.
2. **SAM (Segment Anything Model) & SAG (Self-Attention Guidance):**
* Integrated **SAM** for precise object segmentation and **SAG** to refine attention mechanisms. This ensures a clear separation between the subject and the background, enhancing the clarity of the illustration style.
3. **Automatic Detailer (Face & Hand Refinement):**
* Implemented a post-processing pipeline using **Face and Hand Detailers**. The workflow automatically detects and masks these complex regions, resampling them at higher resolutions to fix artifacts and ensure anatomical correctness.
---
## 🖼️ 4. Results & Portfolio Showcase
The final outputs generated using this model and workflow are archived on Instagram. You can compare the reference inputs with the generated results to verify the technical quality.
* **Instagram Portfolio:** [@eom0am](https://www.instagram.com/eom0am)
---
## ⚠️ 5. Ethical Considerations & License
This project was conducted strictly for **Academic Study and Research purposes**.
### ⛔ Copyright & Usage Warning
* **Intellectual Property:** The copyright and stylistic rights of the LoRA model belong entirely to the original artist, **Kirochy** ([@kirochy_00](https://www.instagram.com/kirochy_00/)).
* **Non-Commercial Use Only:** Utilizing this model file or the workflows for **any commercial purpose (sales, paid commissions, advertising, etc.) is strictly prohibited.**
* **Legal Notice:** Any commercial exploitation may result in legal consequences under copyright laws.
### 📝 Scope of Permitted Use
* ⭕ **Allowed:** Personal study, portfolio research, non-commercial fan art.
* ❌ **Prohibited:** Commercial use, impersonation of the original artist, unauthorized redistribution for profit.
---
**Author:** Um Yunsang
**Role:** CS & Design Convergence Researcher / AI Engineer Candidate |