iitolstykh
/

VIBE-Image-Edit

VIBESanaEditingPipeline

text-guided-editing

Model card Files Files and versions

iitolstykh commited on 16 days ago

Commit

2fe57cc

·

verified ·

1 Parent(s): 86bba3f

Update README.md

Files changed (1) hide show

README.md +51 -0

README.md CHANGED Viewed

@@ -47,6 +47,57 @@ library_name: diffusers
 - **High-Speed Inference:** Utilizes Sana1.5's linear attention mechanism for rapid generation.
 - **Multimodal Understanding:** Qwen3-VL ensures strong alignment between visual content and text instructions.
 ## Citation
 If you use this model in your research or applications, please acknowledge the original projects:

 - **High-Speed Inference:** Utilizes Sana1.5's linear attention mechanism for rapid generation.
 - **Multimodal Understanding:** Qwen3-VL ensures strong alignment between visual content and text instructions.
+# Inference Requirements
+- `vibe` library
+```bash
+pip install git+https://github.com/ai-forever/VIBE
+```
+- requirements for `vibe` library:
+```bash
+pip install transformers==4.57.1 torchvision==0.21.0 torch==2.6.0 diffusers==0.33.1 loguru==0.7.3
+```
+# Quick start
+```python
+from PIL import Image
+import requests
+from io import BytesIO
+from huggingface_hub import snapshot_download
+from vibe.editor import ImageEditor
+# Download model
+model_path = snapshot_download(
+    repo_id="iitolstykh/VIBE-Image-Edit",
+    repo_type="model",
+)
+# Load model
+editor = ImageEditor(
+    checkpoint_path=model_path,
+    image_guidance_scale=1.2,
+    guidance_scale=4.5,
+    num_inference_steps=20,
+    device="cuda:0",
+)
+# Download test image
+resp = requests.get('https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/3f58a82a-b4b4-40c3-a318-43f9350fcd02/original=true,quality=90/115610275.jpeg')
+image = Image.open(BytesIO(resp.content))
+# Generate edited image
+edited_image = editor.generate_edited_image(
+    instruction="let this case swim in the river",
+    conditioning_image=image,
+    num_images_per_prompt=1,
+)[0]
+edited_image.save(f"edited_image.jpg", quality=100)
+```
 ## Citation
 If you use this model in your research or applications, please acknowledge the original projects: