Update README.md

5589702 verified 3 months ago

5.5 kB

	---
	tags:
	- text-to-image
	- lora
	- diffusers
	- template:diffusion-lora
	base_model: black-forest-labs/FLUX.1-Kontext-dev
	instance_prompt: >-
	[photo content], render the image from the left-side perspective, keeping
	consistent lighting, textures, and proportions. Maintain the realism of all
	surrounding elements while revealing previously unseen left-side details
	consistent with the object’s or scene’s structure.
	license: other
	license_name: flux-1-dev-non-commercial-license
	license_link: LICENSE.md
	language:
	- en
	pipeline_tag: image-to-image
	library_name: diffusers
	---

	![1](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/rXkxuP7K8JTRXHGdAzsk2.png)

	# Kontext-CAM-Left-View

	The Kontext-CAM-Left-View is an experimental adapter for black-forest-lab's FLUX.1-Kontext-dev, designed to generate a left-side perspective of the scene while preserving consistent lighting, textures, and proportions. The model maintains the realism of all surrounding elements and accurately reveals previously unseen left-side details, ensuring seamless perspective alignment and environmental coherence. It was trained on 800 image pairs (400 start images and 400 end images) to deliver high-fidelity, geometry-consistent left-side viewpoint generation.

	> [!note]
	[photo content], render the image from the left-side perspective, keeping consistent lighting, textures, and proportions. Maintain the realism of all surrounding elements while revealing previously unseen left-side details consistent with the object’s or scene’s structure.

	> You modified the prompt, altering its properties and subjective elements. Note: this is an experimental adapter and may contain artifacts.

	---

	## Sample Inferences : Demo

	<table style="width:100%; border-collapse:collapse;">
	<tr>
	<td style="width:50%; text-align:center;">
	<img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/lZ8asnkoamFUH1ClFgn6H.jpeg"
	alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/>
	</td>
	<td style="width:50%; text-align:center;">
	<img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/F92WuRNLReDYS-nXXBLUz.webp"
	alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/>
	</td>
	</tr>
	</table>

	<table style="width:100%; border-collapse:collapse;">
	<tr>
	<td style="width:50%; text-align:center;">
	<img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/Txk4Mnk7q6wkGFdpe276J.jpeg"
	alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/>
	</td>
	<td style="width:50%; text-align:center;">
	<img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/uHdGFGI-4plezer-JSAjs.webp"
	alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/>
	</td>
	</tr>
	</table>

	---

	## Parameter Settings

	\| Setting \| Value \|
	\| ------------------------ \| ------------------------ \|
	\| Module Type \| Adapter \|
	\| Base Model \| FLUX.1 Kontext Dev - fp8 \|
	\| Trigger Words \| [photo content], render the image from the left-side perspective, keeping consistent lighting, textures, and proportions. Maintain the realism of all surrounding elements while revealing previously unseen left-side details consistent with the object’s or scene’s structure. \|
	\| Image Processing Repeats \| 42 \|
	\| Epochs \| 22 \|
	\| Save Every N Epochs \| 1 \|

	Labeling: DeepCaption-VLA-7B(natural language & English)

	Total Images Used for Training : 800 Image Pairs (400 Start, 400 End)

	## Training Parameters

	\| Setting \| Value \|
	\| --------------------------- \| --------- \|
	\| Seed \| - \|
	\| Clip Skip \| - \|
	\| Text Encoder LR \| 0.00001 \|
	\| UNet LR \| 0.00005 \|
	\| LR Scheduler \| constant \|
	\| Optimizer \| AdamW8bit \|
	\| Network Dimension \| 64 \|
	\| Network Alpha \| 32 \|
	\| Gradient Accumulation Steps \| - \|

	## Label Parameters

	\| Setting \| Value \|
	\| --------------- \| ----- \|
	\| Shuffle Caption \| - \|
	\| Keep N Tokens \| - \|

	## Advanced Parameters

	\| Setting \| Value \|
	\| ------------------------- \| ----- \|
	\| Noise Offset \| 0.03 \|
	\| Multires Noise Discount \| 0.1 \|
	\| Multires Noise Iterations \| 10 \|
	\| Conv Dimension \| - \|
	\| Conv Alpha \| - \|
	\| Batch Size \| - \|
	\| Steps \| 3300 & 400(warm up) \|
	\| Sampler \| euler \|

	---

	## Trigger words

	You should use `[photo content]` to trigger the image generation.

	You should use `render the image from the left-side perspective` to trigger the image generation.

	You should use `keeping consistent lighting` to trigger the image generation.

	You should use `textures` to trigger the image generation.

	You should use `and proportions. Maintain the realism of all surrounding elements while revealing previously unseen left-side details consistent with the object’s or scene’s structure.` to trigger the image generation.

	## Download model

	[Download](/prithivMLmods/Kontext-CAM-Left-View/tree/main) them in the Files & versions tab.