Upload 2553102/2869279/README.md with huggingface_hub
Browse files- 2553102/2869279/README.md +110 -4
2553102/2869279/README.md
CHANGED
|
@@ -1,10 +1,116 @@
|
|
| 1 |
---
|
| 2 |
license: other
|
| 3 |
tags:
|
| 4 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 5 |
---
|
| 6 |
-
Author: [NRDX](https://civitai.red/user/NRDX)
|
| 7 |
|
| 8 |
-
|
| 9 |
|
| 10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: other
|
| 3 |
tags:
|
| 4 |
+
- add
|
| 5 |
+
- anything
|
| 6 |
+
- concept
|
| 7 |
+
- convert
|
| 8 |
+
- edit
|
| 9 |
+
- remove
|
| 10 |
+
- replace
|
| 11 |
---
|
|
|
|
| 12 |
|
| 13 |
+
# EditAnything - v1.0 - 2869279
|
| 14 |
|
| 15 |
+
**Model Type**: LORA
|
| 16 |
+
|
| 17 |
+
**Base Model**: LTXV 2.3
|
| 18 |
+
|
| 19 |
+
**Trigger Words**: Add a/an [subject/object] with [attributes], [location in the scene]., Remove the [subject/object] [location or identifying description]., Replace the [original subject/object] [location] with a/an [new subject/object] with [attributes]., Convert the video into a [style name] style.
|
| 20 |
+
|
| 21 |
+
**Tags**: add, anything, concept, convert, edit, remove, replace
|
| 22 |
+
|
| 23 |
+
## Gallery
|
| 24 |
+
|
| 25 |
+
<table>
|
| 26 |
+
<tr>
|
| 27 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/fb94aab1-476b-41f2-8b92-4371ce4f6785/original=true/127871225.mp4" width="200" controls muted autoplay loop></video></td>
|
| 28 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/ff5bb45e-2106-43d6-959a-2abaaa888947/original=true/127897280.mp4" width="200" controls muted autoplay loop></video></td>
|
| 29 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/e17830bc-adaa-454c-b9bb-105574937659/original=true/127899287.mp4" width="200" controls muted autoplay loop></video></td>
|
| 30 |
+
</tr>
|
| 31 |
+
<tr>
|
| 32 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/42e643f2-5fdd-4137-9cc6-26bb93d9be92/original=true/127871658.mp4" width="200" controls muted autoplay loop></video></td>
|
| 33 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/02c9ccc0-078e-4fc2-b9de-9f2cb8d362dc/original=true/127872455.mp4" width="200" controls muted autoplay loop></video></td>
|
| 34 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/a96e8960-8fa9-4228-972e-fdbfd236801e/original=true/127871875.mp4" width="200" controls muted autoplay loop></video></td>
|
| 35 |
+
</tr>
|
| 36 |
+
<tr>
|
| 37 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/177cdaaa-b23a-42fb-be89-a02a1f9c49da/original=true/127871228.mp4" width="200" controls muted autoplay loop></video></td>
|
| 38 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/1f14e8aa-a026-4797-a28f-366eee8ce423/original=true/127871270.mp4" width="200" controls muted autoplay loop></video></td>
|
| 39 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/f6ce85b7-51a2-47f7-b8b7-f791efab2cae/original=true/127872685.mp4" width="200" controls muted autoplay loop></video></td>
|
| 40 |
+
</tr>
|
| 41 |
+
<tr>
|
| 42 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/2e6923ec-7962-4ed7-9014-5b584377f4c3/original=true/127962336.mp4" width="200" controls muted autoplay loop></video></td>
|
| 43 |
+
</tr>
|
| 44 |
+
</table>
|
| 45 |
+
|
| 46 |
+
## Description
|
| 47 |
+
|
| 48 |
+
This model was trained on **8,000 video pairs**, and training is still ongoing for a few thousand more steps. It is still **experimental**, not trained with a fully professional production target, and the model may be updated unexpectedly as new checkpoints.
|
| 49 |
+
|
| 50 |
+
The current goal is not final polished production quality, but to explore:
|
| 51 |
+
|
| 52 |
+
* edit-anything behavior
|
| 53 |
+
* prompt-following
|
| 54 |
+
* inference tradeoffs
|
| 55 |
+
* synthetic dataset building, especially for **style data**
|
| 56 |
+
|
| 57 |
+
The model was trained around four main prompt patterns:
|
| 58 |
+
|
| 59 |
+
**Add**
|
| 60 |
+
`Add a/an [subject/object] with [clear visual attributes], [precise location in the scene].`
|
| 61 |
+
|
| 62 |
+
**Remove**
|
| 63 |
+
`Remove the [subject/object] [location or identifying description].`
|
| 64 |
+
|
| 65 |
+
**Replace**
|
| 66 |
+
`Replace the [original subject/object] [location] with a/an [new subject/object] with [clear visual attributes].`
|
| 67 |
+
|
| 68 |
+
**Convert / Style**
|
| 69 |
+
`Convert the video into a [style name] style.`
|
| 70 |
+
|
| 71 |
+
**Workflow URL:** `https://huggingface.co/Alissonerdx/LTX-LoRAs/blob/main/workflows/ltx23_edit_anything_v1.json`
|
| 72 |
+
|
| 73 |
+
One important thing during inference is **CFG**.
|
| 74 |
+
|
| 75 |
+
A good starting point is testing a **distilled setup with CFG = 1**. If the edit feels too weak or the model is not following the prompt well enough, increasing **CFG** can be the key. In some cases, increasing the **LoRA strength** to around **1.2** can also help.
|
| 76 |
+
|
| 77 |
+
The workflow is also **not fully optimized yet**. It still needs more testing to find the best combination of:
|
| 78 |
+
|
| 79 |
+
* CFG
|
| 80 |
+
* LoRA strength
|
| 81 |
+
* number of steps
|
| 82 |
+
* model combinations
|
| 83 |
+
|
| 84 |
+
It may also be interesting to combine this model with other models and see what kinds of results emerge.
|
| 85 |
+
|
| 86 |
+
If you can test it, please share your findings. Feedback on prompt behavior, edit strength, consistency, style transfer, and failure cases would be very helpful while training is still in progress.
|
| 87 |
+
|
| 88 |
+
Another very important thing is that the Removal task should have a very clear direction indicating where you want to remove what you want to remove.
|
| 89 |
+
|
| 90 |
+
Examples:
|
| 91 |
+
|
| 92 |
+
Remove the black robot sitting at the table.
|
| 93 |
+
|
| 94 |
+
Remove the person riding the electric scooter on the left.
|
| 95 |
+
|
| 96 |
+
Remove the person with glasses and the microphone in the foreground.
|
| 97 |
+
|
| 98 |
+
Remove the image of the green trees on the top left.
|
| 99 |
+
|
| 100 |
+
Remove the woman and the smoking bottle.
|
| 101 |
+
|
| 102 |
+
For example, if the object are in front, use foreground ... background, left, right, top, bottom.
|
| 103 |
+
|
| 104 |
+
Another way to remove things that don't want to be removed is to simply add a mask, for example magenta, over the object you want to remove, and use this video as a guide. When writing the prompt, you write something like: "Remove object masked with the pink color." Sometimes this is much more precise than waiting for it to recognize what actually needs to be removed, because in this case the biggest indicator is the magenta-colored object.
|
| 105 |
+
|
| 106 |
+
[**If this model was helpful to you in any way, please consider helping me continue creating more model for the price of a coffee.**](https://buymeacoffee.com/nrdx)
|
| 107 |
+
|
| 108 |
+
---
|
| 109 |
+
|
| 110 |
+
Author: [NRDX](https://civitai.com/user/NRDX)
|
| 111 |
+
|
| 112 |
+
Model: [CivitAI Model Page](https://civitai.com/models/2553102?modelVersionId=2869279)
|
| 113 |
+
|
| 114 |
+
Archive: [CivArchive Page](https://civarchive.com/models/2553102?modelVersionId=2869279)
|
| 115 |
+
|
| 116 |
+
<!-- Version: 20260424_all_update -->
|