Instructions to use gotha1312/azis1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use gotha1312/azis1 with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("Qwen/Qwen-Image", dtype=torch.bfloat16, device_map="cuda") pipe.load_lora_weights("gotha1312/azis1") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- Draw Things
- DiffusionBee
| license: apache-2.0 | |
| base_model: Qwen/Qwen-Image | |
| library_name: diffusers | |
| pipeline_tag: text-to-image | |
| tags: | |
| - lora | |
| - qwen-image | |
| - text-to-image | |
| - diffusers | |
| - safetensors | |
| inference: false | |
| # Azis1 LoRA for Qwen-Image | |
| A LoRA (Low-Rank Adaptation) model trained on the Qwen-Image base model for generating images of a specific person. | |
| ## Model Details | |
| | Property | Value | | |
| |----------|-------| | |
| | **Base Model** | [Qwen/Qwen-Image](https://huggingface.co/Qwen/Qwen-Image) | | |
| | **LoRA Rank** | 64 | | |
| | **LoRA Alpha** | 64 | | |
| | **Training Resolution** | 512x512 | | |
| | **Training Epochs** | 5 | | |
| | **Training Images** | ~2400 | | |
| | **Trigger Word** | `azis1` | | |
| ## Usage | |
| ### Trigger Word | |
| Always include `azis1` at the beginning of your prompt for best results. | |
| ### With Diffusers (Python) | |
| ```python | |
| import torch | |
| from diffusers import DiffusionPipeline | |
| # Load base model | |
| pipe = DiffusionPipeline.from_pretrained( | |
| "Qwen/Qwen-Image", | |
| torch_dtype=torch.bfloat16, | |
| ) | |
| pipe.enable_model_cpu_offload() | |
| # Load LoRA | |
| pipe.load_lora_weights("YOUR_USERNAME/azis1-qwen-lora") | |
| # Generate | |
| image = pipe( | |
| prompt="azis1, portrait of a man, professional photography, studio lighting", | |
| num_inference_steps=50, | |
| guidance_scale=4.0, | |
| height=1024, | |
| width=1024, | |
| ).images[0] | |
| image.save("output.png") | |
| ``` | |
| ### With ComfyUI | |
| 1. Download the `.safetensors` file to `ComfyUI/models/loras/` | |
| 2. Use this node setup: | |
| - **UNETLoader** → `qwen_image_fp8_e4m3fn.safetensors` | |
| - **LoraLoaderModelOnly** → `azis1.safetensors` (strength: 0.85) | |
| - **CLIPLoader** → `qwen_2.5_vl_7b_fp8_scaled.safetensors` | |
| - **VAELoader** → `qwen_image_vae.safetensors` | |
| 3. Recommended settings: | |
| - Steps: 50 | |
| - CFG Scale: 4.0 | |
| - Sampler: Euler | |
| ## Example Prompts | |
| ``` | |
| azis1, portrait of a man, professional photography, studio lighting, high quality, 4K | |
| azis1, man in casual clothes, outdoor, natural lighting | |
| azis1, close-up portrait, dramatic lighting, artistic | |
| azis1, man with beard, formal attire, corporate headshot | |
| ``` | |
| ## Training Details | |
| - **Framework**: OneTrainer | |
| - **Optimizer**: AdamW 8-bit | |
| - **Learning Rate**: 0.0001 (Cosine schedule) | |
| - **Batch Size**: 2 (with gradient accumulation of 2) | |
| - **Precision**: bfloat16 | |
| - **Hardware**: NVIDIA RTX 5090 (32GB VRAM) | |
| ## Limitations | |
| - This LoRA only works with **Qwen-Image** base model | |
| - It will NOT work with Stable Diffusion 1.5, SDXL, or Flux models | |
| - Best results at 512x512 to 1024x1024 resolution | |
| ## License | |
| Apache 2.0 - Same as the base Qwen-Image model. | |
| ## Acknowledgments | |
| - [Qwen-Image](https://huggingface.co/Qwen/Qwen-Image) by Alibaba | |
| - [OneTrainer](https://github.com/Nerogar/OneTrainer) for training | |