Tan
Update README.md
964b01c verified
---
license: other
license_name: flux-1-dev-non-commercial-license
license_link: https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md
language:
- en
tags:
- flux
- diffusers
- lora
- kontext
- web-design
- ui-ux
- design-transformation
- instruction-following
base_model: black-forest-labs/FLUX.1-dev
pipeline_tag: image-to-image
---
# InstructDesign-Kontext
Fine-tuned FLUX.1 Kontext [dev] model for transforming web interfaces with natural language instructions.
## Model Description
InstructDesign-Kontext is a LoRA fine-tuned version of FLUX.1 Kontext [dev] specialized in:
- Web interface transformations
- UI/UX redesigns
- Style transfers (dark mode, glassmorphism, etc.)
- Device mockup generation
- Design system applications
### Training Details
- **Base Model**: FLUX.1 Kontext [dev]
- **Training Method**: LoRA fine-tuning
- **Dataset**: 937 web interface transformation pairs
- **Training Steps**: 10,000
- **Training Time**: ~2 weeks (iterative)
## File Structure
```
├── unet/
│ └── flux1-kontext-dev.safetensors (23GB)
├── loras/
│ └── flux_kontext_lora_v4_consolidated_000010000.safetensors (2.6GB)
├── clip/
│ ├── clip_l.safetensors (235MB)
│ └── t5xxl_fp8_e4m3fn_scaled.safetensors (4.9GB)
├── vae/
│ └── ae.safetensors (320MB)
└── workflow/
└── workflow_api.json
```
## Usage with ComfyUI
### 1. Download Models
Download all model files to your ComfyUI models directory:
```bash
# Main model
ComfyUI/models/diffusion_models/flux1-kontext-dev.safetensors
# LoRA
ComfyUI/models/loras/flux_kontext_lora_v4_consolidated_000010000.safetensors
# CLIP encoders
ComfyUI/models/clip/clip_l.safetensors
ComfyUI/models/clip/t5xxl_fp8_e4m3fn_scaled.safetensors
# VAE
ComfyUI/models/vae/ae.safetensors
```
### 2. Load Workflow
Import the `workflow/workflow_api.json` into ComfyUI.
### 3. Configure Parameters
Default optimal settings:
- **Steps**: 20
- **CFG**: 1.0
- **Guidance**: 5.0
- **Sampler**: er_sde
- **Scheduler**: kl_optimal
- **LoRA Strength**: 1.0
## API Usage
See the [GitHub repository](https://github.com/tercumantanumut/instructdesign-kontext) for full API implementation.
```python
# Example API request
import requests
response = requests.post("http://localhost:8000/api/generate", json={
"positive_prompt": "Transform this interface to dark mode with purple accents",
"input_image": "website.png",
"seed": 42,
"steps": 20,
"cfg": 1.0,
"guidance": 5.0
})
```
## Transformation Capabilities
### Style Transformations
- Dark/light mode conversion
- Glassmorphism effects
- Neumorphic design
- Material Design application
- iOS/Android style transfer
### UI/UX Improvements
- Layout modernization
- Accessibility enhancements
- Color scheme updates
- Typography improvements
- Interactive element additions
### Mockup Generation
- Device frame placement
- Environment contextualization
- Billboard/poster mockups
- Multi-device presentations
## Example Prompts
```
"Transform this website to dark mode with purple gradient accents"
"Apply glassmorphism effect with blurred backgrounds"
"Convert to material design 3 with rounded corners"
"Place this interface on an iPhone 15 Pro in a coffee shop"
"Modernize this dated interface with current design trends"
```
## Performance
- **Inference Time**: ~45-70 seconds (NVIDIA GPU 16GB+)
- **VRAM Requirements**: 16GB minimum, 24GB recommended
- **Quality**: 85%+ instruction adherence rate
## Docker Deployment
Complete Docker setup available in the GitHub repository with:
- ComfyUI container
- FastAPI wrapper
- Queue management
- Auto-scaling workers
## Limitations
- Requires significant VRAM (16GB+)
- Best results with clear, specific instructions
- Limited to single image transformations (no batch in single inference)
## Citation
If you use this model, please cite:
```bibtex
@misc{instructdesign-kontext,
title={InstructDesign-Kontext: Web Interface Transformation with FLUX},
author={tercumantanumut},
year={2024},
publisher={Hugging Face}
}
```
## License
This model is subject to the FLUX.1 [dev] Non-Commercial License.
## Acknowledgments
- FLUX.1 team for the base model
- ComfyUI for the inference framework
- 10 weeks of iterative development