Update README.md
Browse files
README.md
CHANGED
|
@@ -14,15 +14,70 @@ license: apache-2.0
|
|
| 14 |
language:
|
| 15 |
- en
|
| 16 |
pipeline_tag: image-to-image
|
| 17 |
-
library_name:
|
| 18 |
---
|
| 19 |
-
# Kontext-CAM-Right-View
|
| 20 |
|
| 21 |
The Kontext-CAM-Right-View is an experimental adapter for black-forest-lab's FLUX.1-Kontext-dev, designed to transform scenes into a right-side camera perspective while preserving natural lighting, accurate geometry, and realistic textures. The model maintains harmony with the original environment, shadows, and visual tone, ensuring smooth perspective transitions and authentic scene composition. It was trained on 800 image pairs (400 start images and 400 end images) to achieve precise, context-aware right-side viewpoint generation.
|
| 22 |
|
| 23 |
> [!note]
|
| 24 |
[photo content], generate the right-side perspective of the scene. Ensure natural lighting, accurate geometry, and realistic textures. Maintain harmony with the original image’s environment, shadows, and visual tone while providing the right-side visual continuation.
|
| 25 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 26 |
## Trigger words
|
| 27 |
|
| 28 |
You should use `[photo content]` to trigger the image generation.
|
|
|
|
| 14 |
language:
|
| 15 |
- en
|
| 16 |
pipeline_tag: image-to-image
|
| 17 |
+
library_name: diffusers
|
| 18 |
---
|
| 19 |
+
# **Kontext-CAM-Right-View**
|
| 20 |
|
| 21 |
The Kontext-CAM-Right-View is an experimental adapter for black-forest-lab's FLUX.1-Kontext-dev, designed to transform scenes into a right-side camera perspective while preserving natural lighting, accurate geometry, and realistic textures. The model maintains harmony with the original environment, shadows, and visual tone, ensuring smooth perspective transitions and authentic scene composition. It was trained on 800 image pairs (400 start images and 400 end images) to achieve precise, context-aware right-side viewpoint generation.
|
| 22 |
|
| 23 |
> [!note]
|
| 24 |
[photo content], generate the right-side perspective of the scene. Ensure natural lighting, accurate geometry, and realistic textures. Maintain harmony with the original image’s environment, shadows, and visual tone while providing the right-side visual continuation.
|
| 25 |
|
| 26 |
+
---
|
| 27 |
+
|
| 28 |
+
|
| 29 |
+
## Parameter Settings
|
| 30 |
+
|
| 31 |
+
| Setting | Value |
|
| 32 |
+
| ------------------------ | ------------------------ |
|
| 33 |
+
| Module Type | Adapter |
|
| 34 |
+
| Base Model | FLUX.1 Kontext Dev - fp8 |
|
| 35 |
+
| Trigger Words | [photo content], generate the right-side perspective of the scene. Ensure natural lighting, accurate geometry, and realistic textures. Maintain harmony with the original image’s environment, shadows, and visual tone while providing the right-side visual continuation. |
|
| 36 |
+
| Image Processing Repeats | 43 |
|
| 37 |
+
| Epochs | 23 |
|
| 38 |
+
| Save Every N Epochs | 1 |
|
| 39 |
+
|
| 40 |
+
Labeling: DeepCaption-VLA-7B(natural language & English)
|
| 41 |
+
|
| 42 |
+
Total Images Used for Training : 800 Image Pairs (400 Start, 400 End)
|
| 43 |
+
|
| 44 |
+
## Training Parameters
|
| 45 |
+
|
| 46 |
+
| Setting | Value |
|
| 47 |
+
| --------------------------- | --------- |
|
| 48 |
+
| Seed | - |
|
| 49 |
+
| Clip Skip | - |
|
| 50 |
+
| Text Encoder LR | 0.00001 |
|
| 51 |
+
| UNet LR | 0.00005 |
|
| 52 |
+
| LR Scheduler | constant |
|
| 53 |
+
| Optimizer | AdamW8bit |
|
| 54 |
+
| Network Dimension | 64 |
|
| 55 |
+
| Network Alpha | 32 |
|
| 56 |
+
| Gradient Accumulation Steps | - |
|
| 57 |
+
|
| 58 |
+
## Label Parameters
|
| 59 |
+
|
| 60 |
+
| Setting | Value |
|
| 61 |
+
| --------------- | ----- |
|
| 62 |
+
| Shuffle Caption | - |
|
| 63 |
+
| Keep N Tokens | - |
|
| 64 |
+
|
| 65 |
+
## Advanced Parameters
|
| 66 |
+
|
| 67 |
+
| Setting | Value |
|
| 68 |
+
| ------------------------- | ----- |
|
| 69 |
+
| Noise Offset | 0.03 |
|
| 70 |
+
| Multires Noise Discount | 0.1 |
|
| 71 |
+
| Multires Noise Iterations | 10 |
|
| 72 |
+
| Conv Dimension | - |
|
| 73 |
+
| Conv Alpha | - |
|
| 74 |
+
| Batch Size | - |
|
| 75 |
+
| Steps | 3200 & 300(warm up) |
|
| 76 |
+
| Sampler | euler |
|
| 77 |
+
|
| 78 |
+
---
|
| 79 |
+
|
| 80 |
+
|
| 81 |
## Trigger words
|
| 82 |
|
| 83 |
You should use `[photo content]` to trigger the image generation.
|