Update README.md
Browse files
README.md
CHANGED
|
@@ -4,10 +4,6 @@ tags:
|
|
| 4 |
- lora
|
| 5 |
- diffusers
|
| 6 |
- template:diffusion-lora
|
| 7 |
-
widget:
|
| 8 |
-
- output:
|
| 9 |
-
url: images/eeeeeeeeeee.png
|
| 10 |
-
text: '-'
|
| 11 |
base_model: black-forest-labs/FLUX.1-Kontext-dev
|
| 12 |
instance_prompt: >-
|
| 13 |
[photo content], recreate the scene from a bottom-up perspective. Preserve
|
|
@@ -15,11 +11,70 @@ instance_prompt: >-
|
|
| 15 |
background sky or floor elements adjust naturally to the new angle,
|
| 16 |
maintaining authentic shadowing and perspective.
|
| 17 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
---
|
| 19 |
-
# Kontext-Bottom-Up-View
|
| 20 |
|
| 21 |
-
|
| 22 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
|
| 24 |
## Trigger words
|
| 25 |
|
|
@@ -33,8 +88,6 @@ You should use `and lighting direction to enhance realism. Ensure the background
|
|
| 33 |
|
| 34 |
You should use `maintaining authentic shadowing and perspective.` to trigger the image generation.
|
| 35 |
|
| 36 |
-
|
| 37 |
## Download model
|
| 38 |
|
| 39 |
-
|
| 40 |
-
[Download](/prithivMLmods/Kontext-Bottom-Up-View/tree/main) them in the Files & versions tab.
|
|
|
|
| 4 |
- lora
|
| 5 |
- diffusers
|
| 6 |
- template:diffusion-lora
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
base_model: black-forest-labs/FLUX.1-Kontext-dev
|
| 8 |
instance_prompt: >-
|
| 9 |
[photo content], recreate the scene from a bottom-up perspective. Preserve
|
|
|
|
| 11 |
background sky or floor elements adjust naturally to the new angle,
|
| 12 |
maintaining authentic shadowing and perspective.
|
| 13 |
license: apache-2.0
|
| 14 |
+
language:
|
| 15 |
+
- en
|
| 16 |
+
pipeline_tag: image-to-image
|
| 17 |
+
library_name: transformers
|
| 18 |
---
|
| 19 |
+
# **Kontext-Bottom-Up-View**
|
| 20 |
|
| 21 |
+
The Kontext-Bottom-Up-View is an experimental adapter for black-forest-lab's FLUX.1-Kontext-dev, designed to transform scenes into a bottom-up perspective, preserving accurate depth, scale, and lighting direction to enhance overall realism. The model ensures that background elements such as sky or ground surfaces adjust naturally to the new viewing angle, maintaining coherent geometry, texture consistency, and visual balance. It was trained on 800 image pairs (400 start images and 400 end images) to achieve precise, geometry-consistent bottom-up scene generation.
|
| 22 |
|
| 23 |
+
> [!note]
|
| 24 |
+
[photo content], recreate the scene from a bottom-up perspective. Preserve accurate depth, scale, and lighting direction to enhance realism. Ensure the background sky or floor elements adjust naturally to the new angle, maintaining authentic shadowing and perspective.
|
| 25 |
+
|
| 26 |
+
---
|
| 27 |
+
|
| 28 |
+
## Parameter Settings
|
| 29 |
+
|
| 30 |
+
| Setting | Value |
|
| 31 |
+
| ------------------------ | ------------------------ |
|
| 32 |
+
| Module Type | Adapter |
|
| 33 |
+
| Base Model | FLUX.1 Kontext Dev - fp8 |
|
| 34 |
+
| Trigger Words | [photo content], upscale the low-quality image to 4K resolution, enhancing sharpness, clarity, and fine details while preserving the original texture, colors, lighting, and natural appearance. Remove noise, blur, and compression artifacts without over-smoothing or distorting facial or object features. Ensure realistic depth, balanced contrast, and accurate tones, achieving a high-definition, lifelike result that maintains the integrity of the original image. |
|
| 35 |
+
| Image Processing Repeats | 45 |
|
| 36 |
+
| Epochs | 24 |
|
| 37 |
+
| Save Every N Epochs | 1 |
|
| 38 |
+
|
| 39 |
+
Labeling: DeepCaption-VLA-7B(natural language & English)
|
| 40 |
+
|
| 41 |
+
Total Images Used for Training : 800 Image Pairs (400 Start, 400 End)
|
| 42 |
+
|
| 43 |
+
## Training Parameters
|
| 44 |
+
|
| 45 |
+
| Setting | Value |
|
| 46 |
+
| --------------------------- | --------- |
|
| 47 |
+
| Seed | - |
|
| 48 |
+
| Clip Skip | - |
|
| 49 |
+
| Text Encoder LR | 0.00001 |
|
| 50 |
+
| UNet LR | 0.00005 |
|
| 51 |
+
| LR Scheduler | constant |
|
| 52 |
+
| Optimizer | AdamW8bit |
|
| 53 |
+
| Network Dimension | 64 |
|
| 54 |
+
| Network Alpha | 32 |
|
| 55 |
+
| Gradient Accumulation Steps | - |
|
| 56 |
+
|
| 57 |
+
## Label Parameters
|
| 58 |
+
|
| 59 |
+
| Setting | Value |
|
| 60 |
+
| --------------- | ----- |
|
| 61 |
+
| Shuffle Caption | - |
|
| 62 |
+
| Keep N Tokens | - |
|
| 63 |
+
|
| 64 |
+
## Advanced Parameters
|
| 65 |
+
|
| 66 |
+
| Setting | Value |
|
| 67 |
+
| ------------------------- | ----- |
|
| 68 |
+
| Noise Offset | 0.03 |
|
| 69 |
+
| Multires Noise Discount | 0.1 |
|
| 70 |
+
| Multires Noise Iterations | 10 |
|
| 71 |
+
| Conv Dimension | - |
|
| 72 |
+
| Conv Alpha | - |
|
| 73 |
+
| Batch Size | - |
|
| 74 |
+
| Steps | 3500 & 400(warm up) |
|
| 75 |
+
| Sampler | euler |
|
| 76 |
+
|
| 77 |
+
---
|
| 78 |
|
| 79 |
## Trigger words
|
| 80 |
|
|
|
|
| 88 |
|
| 89 |
You should use `maintaining authentic shadowing and perspective.` to trigger the image generation.
|
| 90 |
|
|
|
|
| 91 |
## Download model
|
| 92 |
|
| 93 |
+
[Download](/prithivMLmods/Kontext-Bottom-Up-View/tree/main) them in the Files & versions tab.
|
|
|