|
|
--- |
|
|
library_name: peft |
|
|
base_model: Tongyi-MAI/Z-Image-Turbo |
|
|
tags: |
|
|
- lora |
|
|
- diffusion |
|
|
- image-generation |
|
|
- japan |
|
|
- photography |
|
|
- realistic |
|
|
license: other |
|
|
datasets: |
|
|
- ThePioneer/japanese-photos |
|
|
language: |
|
|
- en |
|
|
- fr |
|
|
pipeline_tag: text-to-image |
|
|
--- |
|
|
|
|
|
# Japan Realistic LoRA for Z-Image-Turbo |
|
|
|
|
|
A LoRA adapter trained on realistic Japanese photography to enhance Z-Image-Turbo's ability to generate authentic Japanese scenes, urban landscapes, and cultural elements. |
|
|
|
|
|
## Model Description |
|
|
|
|
|
This is a LoRA (Low-Rank Adaptation) adapter trained on the [Tongyi-MAI/Z-Image-Turbo](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo) diffusion model. It specializes in generating realistic photographs of Japanese locations, transportation, architecture, and everyday scenes with authentic lighting and composition. |
|
|
|
|
|
## Training Details |
|
|
|
|
|
- **Base Model**: Tongyi-MAI/Z-Image-Turbo |
|
|
- **Training Steps**: 2,000 |
|
|
- **LoRA Rank (r)**: 32 |
|
|
- **LoRA Alpha**: 32 |
|
|
- **Learning Rate**: 0.0001 |
|
|
- **Optimizer**: AdamW 8-bit |
|
|
- **Batch Size**: 1 (with gradient accumulation of 4) |
|
|
- **Training Resolution**: 512x512 |
|
|
- **Precision**: bfloat16 |
|
|
- **Noise Scheduler**: FlowMatch |
|
|
- **Trained Using**: [Ostris AI-Toolkit](https://github.com/ostris/ai-toolkit) |
|
|
|
|
|
## Usage |
|
|
|
|
|
### Using with Diffusers |
|
|
|
|
|
```python |
|
|
from diffusers import DiffusionPipeline |
|
|
import torch |
|
|
|
|
|
# Load base model |
|
|
pipe = DiffusionPipeline.from_pretrained( |
|
|
"Tongyi-MAI/Z-Image-Turbo", |
|
|
torch_dtype=torch.bfloat16 |
|
|
) |
|
|
pipe.to("cuda") |
|
|
|
|
|
# Load LoRA adapter |
|
|
pipe.load_lora_weights("your-username/japan_realistic") |
|
|
|
|
|
# Generate image |
|
|
prompt = "Photo of a Shinkansen bullet train stopped at a Japanese station platform, overhead roof structure, yellow tactile paving, natural daylight, ultra realistic." |
|
|
image = pipe( |
|
|
prompt=prompt, |
|
|
num_inference_steps=8, |
|
|
guidance_scale=1.0, |
|
|
width=1024, |
|
|
height=1024 |
|
|
).images |
|
|
|
|
|
image.save("output.png") |