Realtime-FLUX

Paused

App Files Files Community

Realtime-FLUX / README.md

ginipick

Update README.md

644e663 verified 6 months ago

preview code

raw

history blame contribute delete

5.2 kB

	---
	title: Realtime FLUX Image
	emoji: 💬⚡
	colorFrom: yellow
	colorTo: pink
	sdk: gradio
	sdk_version: 5.35.0
	app_file: app.py
	pinned: true
	license: mit
	short_description: mcp_server & High quality Images in Realtime
	---
	Looking at this code, it's a Gradio-based application for real-time image generation using the FLUX.1-schnell model. Here's a detailed explanation:

	## English Explanation

	### Overview
	This application provides a real-time image generation interface using the FLUX.1-schnell diffusion model. It features instant preview capabilities where images are generated as you type, making it highly interactive and user-friendly.

	### Key Features

	1. Real-time Generation
	- Images are generated automatically as you type in the prompt
	- Uses GPU acceleration with `@spaces.GPU` decorator
	- Optimized for fast inference with only 1-4 steps

	2. User Interface Components
	- Prompt Input: Text area for describing desired images
	- Generated Image: Real-time display of generated results
	- Enhance Button: Manual trigger for image generation
	- Latency Display: Shows processing time for each generation

	3. Advanced Options
	- Seed Control: For reproducible results (0 to 2³²-1)
	- Randomize Seed: Toggle for random seed generation
	- Width/Height Sliders: Image dimensions (256-2048 pixels)
	- Inference Steps: Control generation quality/speed (1-4 steps)

	4. Special Features
	- Snow Effect: Animated snowflakes falling across the interface
	- Korean Text Detection: Warns when Korean text is detected in prompts
	- Example Gallery: Pre-defined creative prompts for inspiration
	- Automatic CUDA Cache Clearing: Prevents memory overflow

	### Technical Implementation

	1. Model Configuration
	- Uses FLUX.1-schnell with float16 precision for efficiency
	- Custom pipeline with intermediate outputs capability
	- GPU duration limited to 15 seconds per generation

	2. Input Validation
	- Automatic size constraints (256-2048 pixels)
	- Seed validation and randomization
	- Error handling with graceful fallbacks

	3. Performance Optimizations
	- Automatic Mixed Precision (AMP) for faster computation
	- CUDA cache clearing after each generation
	- Minimal inference steps for real-time performance

	### Example Prompts Included
	- Steampunk owl in Victorian clothing
	- Floating island made of books
	- Bioluminescent cyberpunk forest
	- Ancient temple with robot archaeologists
	- Cosmic coffee shop with constellation baristas

	---

	## 한글 설명

	### 개요
	이 애플리케이션은 FLUX.1-schnell 확산 모델을 사용한 실시간 이미지 생성 인터페이스입니다. 타이핑하는 동안 즉시 이미지가 생성되는 기능을 제공하여 매우 상호작용적이고 사용자 친화적입니다.

	### 주요 기능

	1. 실시간 생성
	- 프롬프트를 입력하는 동안 자동으로 이미지 생성
	- `@spaces.GPU` 데코레이터를 통한 GPU 가속
	- 1-4 단계만으로 빠른 추론 최적화

	2. 사용자 인터페이스 구성요소
	- 프롬프트 입력: 원하는 이미지를 설명하는 텍스트 영역
	- 생성된 이미지: 생성 결과의 실시간 표시
	- 향상 버튼: 수동 이미지 생성 트리거
	- 지연 시간 표시: 각 생성의 처리 시간 표시

	3. 고급 옵션
	- 시드 제어: 재현 가능한 결과를 위한 설정 (0 ~ 2³²-1)
	- 시드 무작위화: 무작위 시드 생성 토글
	- 너비/높이 슬라이더: 이미지 크기 (256-2048 픽셀)
	- 추론 단계: 생성 품질/속도 제어 (1-4 단계)

	4. 특별 기능
	- 눈 효과: 인터페이스 전체에 떨어지는 애니메이션 눈송이
	- 한글 텍스트 감지: 프롬프트에 한글이 감지되면 경고 표시
	- 예제 갤러리: 영감을 위한 사전 정의된 창의적 프롬프트
	- 자동 CUDA 캐시 정리: 메모리 오버플로 방지

	### 기술적 구현

	1. 모델 구성
	- 효율성을 위한 float16 정밀도의 FLUX.1-schnell 사용
	- 중간 출력 기능이 있는 커스텀 파이프라인
	- 생성당 GPU 시간을 15초로 제한

	2. 입력 검증
	- 자동 크기 제약 (256-2048 픽셀)
	- 시드 검증 및 무작위화
	- 우아한 폴백을 통한 오류 처리

	3. 성능 최적화
	- 빠른 계산을 위한 자동 혼합 정밀도(AMP)
	- 각 생성 후 CUDA 캐시 정리
	- 실시간 성능을 위한 최소 추론 단계

	### 포함된 예제 프롬프트
	- 빅토리아 시대 의상을 입은 스팀펑크 올빼미
	- 책으로 만들어진 떠다니는 섬
	- 생물발광 사이버펑크 숲
	- 로봇 고고학자가 있는 고대 사원
	- 별자리 바리스타가 있는 우주 커피숍

	### 사용 팁
	- 한글 프롬프트는 지원되지만 영어 프롬프트가 더 나은 결과를 생성합니다
	- 빠른 미리보기를 위해 추론 단계를 낮게 유지하세요
	- 고품질 이미지를 위해서는 "향상" 버튼을 클릭하세요
	- 시드 값을 고정하면 동일한 이미지를 재생성할 수 있습니다