Z-Image-Japan-Expert / README.md

baptle

Update README.md

78ee100 verified 14 days ago

preview code

raw

history blame contribute delete

1.88 kB

metadata

library_name: peft
base_model: Tongyi-MAI/Z-Image-Turbo
tags:
  - lora
  - diffusion
  - image-generation
  - japan
  - photography
  - realistic
license: other
datasets:
  - ThePioneer/japanese-photos
language:
  - en
  - fr
pipeline_tag: text-to-image

Japan Realistic LoRA for Z-Image-Turbo

A LoRA adapter trained on realistic Japanese photography to enhance Z-Image-Turbo's ability to generate authentic Japanese scenes, urban landscapes, and cultural elements.

Model Description

This is a LoRA (Low-Rank Adaptation) adapter trained on the Tongyi-MAI/Z-Image-Turbo diffusion model. It specializes in generating realistic photographs of Japanese locations, transportation, architecture, and everyday scenes with authentic lighting and composition.

Training Details

Base Model: Tongyi-MAI/Z-Image-Turbo
Training Steps: 2,000
LoRA Rank (r): 32
LoRA Alpha: 32
Learning Rate: 0.0001
Optimizer: AdamW 8-bit
Batch Size: 1 (with gradient accumulation of 4)
Training Resolution: 512x512
Precision: bfloat16
Noise Scheduler: FlowMatch
Trained Using: Ostris AI-Toolkit

Usage

Using with Diffusers

from diffusers import DiffusionPipeline
import torch

# Load base model
pipe = DiffusionPipeline.from_pretrained(
    "Tongyi-MAI/Z-Image-Turbo",
    torch_dtype=torch.bfloat16
)
pipe.to("cuda")

# Load LoRA adapter
pipe.load_lora_weights("your-username/japan_realistic")

# Generate image
prompt = "Photo of a Shinkansen bullet train stopped at a Japanese station platform, overhead roof structure, yellow tactile paving, natural daylight, ultra realistic."
image = pipe(
    prompt=prompt,
    num_inference_steps=8,
    guidance_scale=1.0,
    width=1024,
    height=1024
).images

image.save("output.png")