--- license: other library_name: comfyui pipeline_tag: image-to-image tags: - stable-diffusion - stable-diffusion-diffusers - image-to-image - lora - comfyui-workflow - education - portfolio - art - onetrainer base_model: stabilityai/stable-diffusion-xl-base-1.0 --- # 🎨 CS x Design Convergence Project: Generative AI Pipeline & Workflow Archive > **"Bridging Technical Logic with Aesthetic Sensibility"** > > This repository serves as a **Portfolio Archive** documenting the construction of Generative AI image generation pipelines and workflow optimization. > As a result of an interdisciplinary curriculum merging **Computer Science and Design**, this project demonstrates the end-to-end process from data collection and model fine-tuning to the design of advanced inference workflows. --- ## 📋 1. Project Overview The core objective of this project is to demonstrate the ability to **accurately train specific artistic styles** and implement them into **highly controllable workflows**, going beyond simple prompt engineering. It aims to prove both technical proficiency (Model Architecture, Latent Space understanding) and artistic expression (Style Transfer). * **Key Activities:** Custom LoRA Training, Advanced ComfyUI Workflow Design, Automated Pipeline Scripting. * **Tools Used:** ComfyUI, OneTrainer, Stable Diffusion, Python, Hugging Face. --- ## 🧠 2. Model Training Methodology: Kirochy Style LoRA To replicate the unique style of the illustrator **Kirochy**, I conducted LoRA (Low-Rank Adaptation) training with a rigorous data processing approach. ### 2.1 Data Acquisition & Preprocessing * **Data Source:** Aggregated reference illustrations from the artist's official portfolios ([Instagram @kirochy_00](https://www.instagram.com/kirochy_00/), X). * **Preprocessing:** Implemented **OneTrainer** to handle various resolutions and aspect ratios via bucketing. Conducted detailed tagging to capture specific stylistic features (line art weight, color palettes, shading techniques). ### 2.2 Training Framework & Optimization * **Engine:** Trained using **OneTrainer** for precise parameter control. * **Optimization:** Adjusted Epochs and Learning Rates iteratively to balance between style fidelity and generalization, ensuring the model avoids overfitting while retaining the artist's signature touch. --- ## ⚙️ 3. Workflow Architecture: P2A (Photo to Anime) Pipeline The `p2a.ai.json` file in this repository is a highly sophisticated **Img2Img Workflow** designed to convert real-world photos into Kirochy-style illustrations. To solve common structural distortion issues in style transfer, I engineered a multi-stage processing pipeline. ### 3.1 Technical Logic & Customization This workflow is not a mere copy-paste; it is a **custom-built architecture** integrating various advanced techniques researched from diverse community workflows and technical documentation. 1. **ControlNet Integration (Structural Integrity):** * Utilized ControlNet algorithms to strictly preserve the pose and depth information of the source image, preventing the "hallucinations" often seen in generative models. 2. **SAM (Segment Anything Model) & SAG (Self-Attention Guidance):** * Integrated **SAM** for precise object segmentation and **SAG** to refine attention mechanisms. This ensures a clear separation between the subject and the background, enhancing the clarity of the illustration style. 3. **Automatic Detailer (Face & Hand Refinement):** * Implemented a post-processing pipeline using **Face and Hand Detailers**. The workflow automatically detects and masks these complex regions, resampling them at higher resolutions to fix artifacts and ensure anatomical correctness. --- ## 🖼️ 4. Results & Portfolio Showcase The final outputs generated using this model and workflow are archived on Instagram. You can compare the reference inputs with the generated results to verify the technical quality. * **Instagram Portfolio:** [@eom0am](https://www.instagram.com/eom0am) --- ## ⚠️ 5. Ethical Considerations & License This project was conducted strictly for **Academic Study and Research purposes**. ### ⛔ Copyright & Usage Warning * **Intellectual Property:** The copyright and stylistic rights of the LoRA model belong entirely to the original artist, **Kirochy** ([@kirochy_00](https://www.instagram.com/kirochy_00/)). * **Non-Commercial Use Only:** Utilizing this model file or the workflows for **any commercial purpose (sales, paid commissions, advertising, etc.) is strictly prohibited.** * **Legal Notice:** Any commercial exploitation may result in legal consequences under copyright laws. ### 📝 Scope of Permitted Use * ⭕ **Allowed:** Personal study, portfolio research, non-commercial fan art. * ❌ **Prohibited:** Commercial use, impersonation of the original artist, unauthorized redistribution for profit. --- **Author:** Um Yunsang **Role:** CS & Design Convergence Researcher / AI Engineer Candidate