---
library_name: peft
base_model: Tongyi-MAI/Z-Image-Turbo
tags:
- lora
- diffusion
- image-generation
- japan
- photography
- realistic
license: other
datasets:
- ThePioneer/japanese-photos
language:
- en
- fr
pipeline_tag: text-to-image
---

# Japan Realistic LoRA for Z-Image-Turbo

A LoRA adapter trained on realistic Japanese photography to enhance Z-Image-Turbo's ability to generate authentic Japanese scenes, urban landscapes, and cultural elements.

## Model Description

This is a LoRA (Low-Rank Adaptation) adapter trained on the [Tongyi-MAI/Z-Image-Turbo](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo) diffusion model. It specializes in generating realistic photographs of Japanese locations, transportation, architecture, and everyday scenes with authentic lighting and composition.

## Training Details

- **Base Model**: Tongyi-MAI/Z-Image-Turbo
- **Training Steps**: 2,000
- **LoRA Rank (r)**: 32
- **LoRA Alpha**: 32
- **Learning Rate**: 0.0001
- **Optimizer**: AdamW 8-bit
- **Batch Size**: 1 (with gradient accumulation of 4)
- **Training Resolution**: 512x512
- **Precision**: bfloat16
- **Noise Scheduler**: FlowMatch
- **Trained Using**: [Ostris AI-Toolkit](https://github.com/ostris/ai-toolkit)

## Usage

### Using with Diffusers

```python
from diffusers import DiffusionPipeline
import torch

# Load base model
pipe = DiffusionPipeline.from_pretrained(
    "Tongyi-MAI/Z-Image-Turbo",
    torch_dtype=torch.bfloat16
)
pipe.to("cuda")

# Load LoRA adapter
pipe.load_lora_weights("your-username/japan_realistic")

# Generate image
prompt = "Photo of a Shinkansen bullet train stopped at a Japanese station platform, overhead roof structure, yellow tactile paving, natural daylight, ultra realistic."
image = pipe(
    prompt=prompt,
    num_inference_steps=8,
    guidance_scale=1.0,
    width=1024,
    height=1024
).images

image.save("output.png")