TMElyralab
/

lyraDiff-IP-Adapters

Safetensors

Model card Files Files and versions

xet

Community

yibolu96 commited on Mar 13, 2025

Commit

e17f382

verified ·

1 Parent(s): a7f5375

Update README.md

Browse files

Files changed (1) hide show

README.md +78 -1

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ license: apache-2.0
 <div align="center">
 Sa Xiao<sup>*</sup>, Yibo Lu<sup>*</sup>,  Kangjian Wu<sup>*</sup>, Bin Wu<sup>†</sup>, Haoxiong Su, Mian Peng, Qiwen Mao, Wenjiang Zhou</br>(*co-first author), (†Corresponding Author, benbinwu@tencent.com)</br>
 Lyra Lab, Tencent Music Entertainment</br>
-<p>[<a href="https://github.com/TMElyralab/lyraDiff">github</a>] [<a href="https://huggingface.co/TMElyralab/lyraDiff">huggingface</a>] </p>
 </div>
 ## Introduction
@@ -27,8 +27,85 @@ The core features include:
 ## Usage
 We provide a reference implementation of lyraDiff version of SD1.5/SDXL, as well as sampling code, in a dedicated [github repository](https://github.com/TMElyralab/lyraDiff).
 ## Citation
 ``` bibtex
 @Misc{lyraDiff_2025,

 <div align="center">
 Sa Xiao<sup>*</sup>, Yibo Lu<sup>*</sup>,  Kangjian Wu<sup>*</sup>, Bin Wu<sup>†</sup>, Haoxiong Su, Mian Peng, Qiwen Mao, Wenjiang Zhou</br>(*co-first author), (†Corresponding Author, benbinwu@tencent.com)</br>
 Lyra Lab, Tencent Music Entertainment</br>
+<p>[<a href="https://github.com/TMElyralab/lyraDiff">github</a>] </p>
 </div>
 ## Introduction
 ## Usage
+<!-- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6461b412846a6c8c8305319d/8U6yNHTPLaOC3gIWJZWGL.png)
+ -->
 We provide a reference implementation of lyraDiff version of SD1.5/SDXL, as well as sampling code, in a dedicated [github repository](https://github.com/TMElyralab/lyraDiff).
+### Example
+We provide minimal [script](https://github.com/TMElyralab/lyraDiff/blob/main/examples/SDXL/ipadapter_demo.py) for running SDXL models + IP-Adapter with lyraDiff as follows:
+```python
+import torch
+import time
+import sys, os
+from diffusers import StableDiffusionXLPipeline
+from lyradiff.lyradiff_model.module.lyradiff_ip_adapter import LyraIPAdapter
+from transformers import CLIPTextModel, CLIPTokenizer, CLIPTextModelWithProjection
+from lyradiff.lyradiff_model.lyradiff_unet_model import LyraDiffUNet2DConditionModel
+from lyradiff.lyradiff_model.lyradiff_vae_model import LyraDiffVaeModel
+from diffusers import EulerAncestralDiscreteScheduler
+from PIL import Image
+from diffusers.utils import load_image
+import GPUtil
+model_path = "/path/to/sdxl/model/"
+vae_model_path = "/path/to/sdxl/sdxl-vae-fp16-fix"
+text_encoder = CLIPTextModel.from_pretrained(model_path, subfolder="text_encoder").to(torch.float16).to(torch.device("cuda"))
+text_encoder_2 = CLIPTextModelWithProjection.from_pretrained(model_path, subfolder="text_encoder_2").to(torch.float16).to(torch.device("cuda"))
+tokenizer = CLIPTokenizer.from_pretrained(model_path, subfolder="tokenizer")
+tokenizer_2 = CLIPTokenizer.from_pretrained( model_path, subfolder="tokenizer_2")
+unet = LyraDiffUNet2DConditionModel(is_sdxl=True)
+vae = LyraDiffVaeModel(scaling_factor=0.13025, is_upcast=False)
+unet.load_from_diffusers_model(os.path.join(model_path, "unet"))
+vae.load_from_diffusers_model(vae_model_path)
+scheduler = EulerAncestralDiscreteScheduler.from_pretrained(model_path, subfolder="scheduler", timestep_spacing="linspace")
+pipe = StableDiffusionXLPipeline(
+    vae=vae,
+    unet=unet,
+    text_encoder=text_encoder,
+    text_encoder_2=text_encoder_2,
+    tokenizer=tokenizer,
+    tokenizer_2=tokenizer_2,
+    scheduler=scheduler
+)
+ip_ckpt = "/path/to/sdxl/ip_ckpt/ip-adapter-plus_sdxl_vit-h.bin"
+image_encoder_path = "/path/to/sdxl/ip_ckpt/image_encoder"
+# Create LyraIPAdapter
+ip_adapter = LyraIPAdapter(unet_model=unet.model, sdxl=True, device=torch.device("cuda"), ip_ckpt=ip_ckpt, ip_plus=True, image_encoder_path=image_encoder_path, num_ip_tokens=16, ip_projection_dim=1024)
+# load ip_adapter image
+ip_image = load_image("https://cdn-uploads.huggingface.co/production/uploads/6461b412846a6c8c8305319d/8U6yNHTPLaOC3gIWJZWGL.png")
+ip_scale = 0.5
+# get ip image embedding and pass it to the pipeline
+ip_image_embedding = [ip_adapter.get_image_embeds_lyradiff(ip_image)['ip_hidden_states']]
+# unet set ip adapter scale in unet model obj, since we cannot set ip_adapter_scale through diffusers pipeline
+unet.set_ip_adapter_scale(ip_scale)
+for i in range(3):
+    generator = torch.Generator("cuda").manual_seed(123)
+    start = time.perf_counter()
+    images = pipe(prompt="a beautiful girl, cartoon style",
+                   height=1024,
+                   width=1024,
+                   num_inference_steps=20,
+                   num_images_per_prompt=1,
+                   guidance_scale=7.5,
+                   negative_prompt="NSFW",
+                   generator=torch.Generator("cuda").manual_seed(123),
+                   ip_adapter_image_embeds=ip_image_embedding
+                   )[0]
+    images[0].save(f"sdxl_ip_{i}.png")
+```
 ## Citation
 ``` bibtex
 @Misc{lyraDiff_2025,