rmsandu
/

fourviews-incontext-lora

@@ -4,6 +4,7 @@ tags:
 - lora
 - diffusers
 - template:diffusion-lora
 widget:
 - text: >-
     "[FOUR-VIEWS] This set of four images show different angles of a light blue
@@ -13,15 +14,6 @@ widget:
     photo shows a back view of the bag."
   output:
     url: images/composite_example.jpeg
-- text: '-'
-  output:
-    url: images/composite_example.jpeg
-- text: '-'
-  output:
-    url: images/004.png
-- text: '-'
-  output:
-    url: images/composite_example.jpeg
 - text: >-
     [FOUR-VIEWS] a red desk lamp from multiple views;[TOP-LEFT] This photo shows
     a 45-degree angle of desk lamp;[TOP-RIGHT] This photo shows a high-angle
@@ -32,7 +24,7 @@ widget:
 base_model: black-forest-labs/FLUX.1-dev
 instance_prompt: '[FOUR-VIEWS]'
 license: apache-2.0
 ---
 # fourviews-incontext-lora
@@ -40,47 +32,60 @@ license: apache-2.0
 ## Model description
----
-license: apache-2.0
-base_model: black-forest-labs&#x2F;FLUX-1-dev
-tags:
-  - flux
-  - lora
   - 2x2-grid
   - in-context
 model_type: lora
----
-# &#x60;[FOUR-VIEWS]&#x60; 2 × 2-Grid LoRA
 **Base:** FLUX-1-dev
 **Images:** 126 custom composites
 **Steps:** 800 (≈ 12.7 epochs)
 **Rank:** 8
-**Trigger token:** &#x60;[FOUR-VIEWS]&#x60;
-&#x60;&#x60;&#x60;python
-from diffusers import DiffusionPipeline
-pipe &#x3D; DiffusionPipeline.from_pretrained(
-        &quot;black-forest-labs&#x2F;FLUX-1-dev&quot;,
-        torch_dtype&#x3D;&quot;auto&quot;,
-        trust_remote_code&#x3D;True).to(&quot;cuda&quot;)
-pipe.load_lora_weights(&quot;rmsandu&#x2F;fourviews-lora&quot;)
-img &#x3D; pipe(
-    &quot;[FOUR-VIEWS] a jade dragon statue&quot;,
-    num_inference_steps&#x3D;10, height&#x3D;1024, width&#x3D;1024
 ).images[0]
-img.save(&quot;dragon_grid.png&quot;)
 ## Trigger words
 You should use `[FOUR-VIEWS]` to trigger the image generation.
-## Download model
 Weights for this model are available in Safetensors format.
-[Download](/rmsandu/fourviews-incontext-lora/tree/main) them in the Files & versions tab.

 - lora
 - diffusers
 - template:diffusion-lora
+- flux
 widget:
 - text: >-
     "[FOUR-VIEWS] This set of four images show different angles of a light blue
     photo shows a back view of the bag."
   output:
     url: images/composite_example.jpeg
 - text: >-
     [FOUR-VIEWS] a red desk lamp from multiple views;[TOP-LEFT] This photo shows
     a 45-degree angle of desk lamp;[TOP-RIGHT] This photo shows a high-angle
 base_model: black-forest-labs/FLUX.1-dev
 instance_prompt: '[FOUR-VIEWS]'
 license: apache-2.0
+pipeline_tag: text-to-image
 ---
 # fourviews-incontext-lora
 ## Model description
+base_model: black-forest-labs;FLUX-1-dev
   - 2x2-grid
   - in-context
 model_type: lora
+Inspired by [In-Context-LoRA](https:&#x2F;&#x2F;github.com&#x2F;ali-vilab&#x2F;In-Context-LoRA), this project aims to generate four multi-view images of the same scene or object simultaneously. By using flux with the multiview-incontext-lora, we can divide the images into portions to obtain novel views.
+> **_NOTE:_** This is a beta release of the model. The consistency between views may not be perfect, and the model might sometimes generate views that don't perfectly align or maintain exact object positions across viewpoints.
+# [FOUR-VIEWS]&#x60; 2 × 2-Grid LoRA
 **Base:** FLUX-1-dev
 **Images:** 126 custom composites
 **Steps:** 800 (≈ 12.7 epochs)
 **Rank:** 8
+**Trigger token:**[FOUR-VIEWS];
+```python
+import torch
+from diffusers import FluxPipeline
+pipeline = FluxPipeline.from_pretrained(
+    "black-forest-labs/FLUX.1-dev",
+    torch_dtype=torch.bfloat16,
+)
+pipeline.load_lora_weights(
+    "rmsandu/fourviews-incontext-lora",
+    weight_name="twoview-incontext-b03.safetensors",
+)
+pipeline.fuse_lora()
+prompt = f"[FOUR-VIEWS] This set of four images shows a jade dragon statue different viewpoints. [TOP-LEFT] This photo shows a 45-degree angle of jade statue;[TOP-RIGHT] This photo shows a high-angle shot of the statue; [BOTTOM-LEFT] Here is a side view shot of the statue; [BOTTOM-RIGHT] The back view of the statue."
+image_height = 512
+image_width = 512
+output = pipeline(
+    prompt=prompt,
+    height=int(image_height),
+    width=int(image_width),
+    num_inference_steps=30,
+    guidance_scale=3.5,
 ).images[0]
+output.save("fourview-incontext-beta.png")
+```
 ## Trigger words
 You should use `[FOUR-VIEWS]` to trigger the image generation.
+# Download model
 Weights for this model are available in Safetensors format.
+[Download](/rmsandu/fourviews-incontext-lora/tree/main) them in the Files & versions tab.