Commit
·
ebfa33a
verified
·
0
Parent(s):
initial commit
Browse files- .gitattributes +55 -0
- README.md +88 -0
.gitattributes
ADDED
|
@@ -0,0 +1,55 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
*.7z filter=lfs diff=lfs merge=lfs -text
|
| 2 |
+
*.arrow filter=lfs diff=lfs merge=lfs -text
|
| 3 |
+
*.bin filter=lfs diff=lfs merge=lfs -text
|
| 4 |
+
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
| 5 |
+
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
| 6 |
+
*.ftz filter=lfs diff=lfs merge=lfs -text
|
| 7 |
+
*.gz filter=lfs diff=lfs merge=lfs -text
|
| 8 |
+
*.h5 filter=lfs diff=lfs merge=lfs -text
|
| 9 |
+
*.joblib filter=lfs diff=lfs merge=lfs -text
|
| 10 |
+
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
| 11 |
+
*.lz4 filter=lfs diff=lfs merge=lfs -text
|
| 12 |
+
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
| 13 |
+
*.model filter=lfs diff=lfs merge=lfs -text
|
| 14 |
+
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
| 15 |
+
*.npy filter=lfs diff=lfs merge=lfs -text
|
| 16 |
+
*.npz filter=lfs diff=lfs merge=lfs -text
|
| 17 |
+
*.onnx filter=lfs diff=lfs merge=lfs -text
|
| 18 |
+
*.ot filter=lfs diff=lfs merge=lfs -text
|
| 19 |
+
*.parquet filter=lfs diff=lfs merge=lfs -text
|
| 20 |
+
*.pb filter=lfs diff=lfs merge=lfs -text
|
| 21 |
+
*.pickle filter=lfs diff=lfs merge=lfs -text
|
| 22 |
+
*.pkl filter=lfs diff=lfs merge=lfs -text
|
| 23 |
+
*.pt filter=lfs diff=lfs merge=lfs -text
|
| 24 |
+
*.pth filter=lfs diff=lfs merge=lfs -text
|
| 25 |
+
*.rar filter=lfs diff=lfs merge=lfs -text
|
| 26 |
+
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
| 27 |
+
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
| 28 |
+
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
| 29 |
+
*.tar filter=lfs diff=lfs merge=lfs -text
|
| 30 |
+
*.tflite filter=lfs diff=lfs merge=lfs -text
|
| 31 |
+
*.tgz filter=lfs diff=lfs merge=lfs -text
|
| 32 |
+
*.wasm filter=lfs diff=lfs merge=lfs -text
|
| 33 |
+
*.xz filter=lfs diff=lfs merge=lfs -text
|
| 34 |
+
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 35 |
+
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
# Audio files - uncompressed
|
| 38 |
+
*.pcm filter=lfs diff=lfs merge=lfs -text
|
| 39 |
+
*.sam filter=lfs diff=lfs merge=lfs -text
|
| 40 |
+
*.raw filter=lfs diff=lfs merge=lfs -text
|
| 41 |
+
# Audio files - compressed
|
| 42 |
+
*.aac filter=lfs diff=lfs merge=lfs -text
|
| 43 |
+
*.flac filter=lfs diff=lfs merge=lfs -text
|
| 44 |
+
*.mp3 filter=lfs diff=lfs merge=lfs -text
|
| 45 |
+
*.ogg filter=lfs diff=lfs merge=lfs -text
|
| 46 |
+
*.wav filter=lfs diff=lfs merge=lfs -text
|
| 47 |
+
# Image files - uncompressed
|
| 48 |
+
*.bmp filter=lfs diff=lfs merge=lfs -text
|
| 49 |
+
*.gif filter=lfs diff=lfs merge=lfs -text
|
| 50 |
+
*.png filter=lfs diff=lfs merge=lfs -text
|
| 51 |
+
*.tiff filter=lfs diff=lfs merge=lfs -text
|
| 52 |
+
# Image files - compressed
|
| 53 |
+
*.jpg filter=lfs diff=lfs merge=lfs -text
|
| 54 |
+
*.jpeg filter=lfs diff=lfs merge=lfs -text
|
| 55 |
+
*.webp filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
|
@@ -0,0 +1,88 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
tags:
|
| 3 |
+
- text-to-image
|
| 4 |
+
- lora
|
| 5 |
+
- diffusers
|
| 6 |
+
- template:diffusion-lora
|
| 7 |
+
widget:
|
| 8 |
+
- output:
|
| 9 |
+
url: images/LTX2IV_00236-audio.webp
|
| 10 |
+
text: >-
|
| 11 |
+
The video opens on a cake. A knife, held by a hand, is coming into frame and
|
| 12 |
+
hovering over the cake. The knife then begins cutting into the cake to c4k3
|
| 13 |
+
cakeify it. As the knife slices the cake open, the inside of the cake is
|
| 14 |
+
revealed to be cake with chocolate layers. The knife cuts through and the
|
| 15 |
+
contents of the cake are revealed.
|
| 16 |
+
- output:
|
| 17 |
+
url: images/LTX2IV_00233-audio.webp
|
| 18 |
+
text: >-
|
| 19 |
+
The video opens on a cake. A knife, held by a hand, is coming into frame and
|
| 20 |
+
hovering over the cake. The knife then begins cutting into the cake to c4k3
|
| 21 |
+
cakeify it. As the knife slices the cake open, the inside of the cake is
|
| 22 |
+
revealed to be cake with chocolate layers. The knife cuts through and the
|
| 23 |
+
contents of the cake are revealed.
|
| 24 |
+
- output:
|
| 25 |
+
url: images/LTX2IV_00229-audio.webp
|
| 26 |
+
text: >-
|
| 27 |
+
The video opens on a cake. A knife, held by a hand, is coming into frame and
|
| 28 |
+
hovering over the cake. The knife then begins cutting into the cake to c4k3
|
| 29 |
+
cakeify it. As the knife slices the cake open, the inside of the cake is
|
| 30 |
+
revealed to be cake with chocolate layers. The knife cuts through and the
|
| 31 |
+
contents of the cake are revealed.
|
| 32 |
+
- output:
|
| 33 |
+
url: images/LTX2IV_00221-audio.webp
|
| 34 |
+
text: >-
|
| 35 |
+
The video opens on a cake. A knife, held by a hand, is coming into frame and
|
| 36 |
+
hovering over the cake. The knife then begins cutting into the cake to c4k3
|
| 37 |
+
cakeify it. As the knife slices the cake open, the inside of the cake is
|
| 38 |
+
revealed to be cake with chocolate layers. The knife cuts through and the
|
| 39 |
+
contents of the cake are revealed.
|
| 40 |
+
- output:
|
| 41 |
+
url: images/LTX2IV_00215-audio.webp
|
| 42 |
+
text: >-
|
| 43 |
+
The video opens on a cake. A knife, held by a hand, is coming into frame and
|
| 44 |
+
hovering over the cake. The knife then begins cutting into the cake to c4k3
|
| 45 |
+
cakeify it. As the knife slices the cake open, the inside of the cake is
|
| 46 |
+
revealed to be cake with chocolate layers. The knife cuts through and the
|
| 47 |
+
contents of the cake are revealed.
|
| 48 |
+
- output:
|
| 49 |
+
url: images/LTX2IV_00210-audio.webp
|
| 50 |
+
text: >-
|
| 51 |
+
The video opens on a cake. A knife, held by a hand, is coming into frame and
|
| 52 |
+
hovering over the cake. The knife then begins cutting into the cake to c4k3
|
| 53 |
+
cakeify it. As the knife slices the cake open, the inside of the cake is
|
| 54 |
+
revealed to be cake with chocolate layers. The knife cuts through and the
|
| 55 |
+
contents of the cake are revealed.
|
| 56 |
+
base_model: Lightricks/LTX-2
|
| 57 |
+
instance_prompt: c4k3 cakeify it
|
| 58 |
+
license: other
|
| 59 |
+
license_name: ltx-2-community-license-agreement
|
| 60 |
+
license_link: https://github.com/Lightricks/LTX-2/blob/main/LICENSE
|
| 61 |
+
---
|
| 62 |
+
# LTX-2 Cakeify
|
| 63 |
+
|
| 64 |
+
<Gallery />
|
| 65 |
+
|
| 66 |
+
## Model description
|
| 67 |
+
|
| 68 |
+
# LTX-2 Cakeify
|
| 69 |
+
|
| 70 |
+
We all know the potential of a model reveals itself only when LoRAs are trained. This is my *third* LTX-2 LoRA after [Hydraulic press](https://huggingface.co/kabachuha/ltx2-hydraulic-press) and [Inflate it](https://huggingface.co/kabachuha/ltx2-inflate-it). It follows the quest of porting some the *classic* VFX LoRAs to LTX-2 (which has sound!).
|
| 71 |
+
|
| 72 |
+
This is the second attempt, the first one was with only the attention being tuned. Here, following *inflate-it*, in addition to CREPA and TREAD, I also unfroze **FFN** ("ff.net.0.proj", "ff.net.2"), however I compensated for its parameter growth by lowerting the overall LoRA rank and the size gain was not that large compared to *H-press* (800 vs 640 Mb, and 1 Gb on *inflate*). The LoRA was trained in **2 hours 41 minutes** (1400 steps) on a single 5090, which I find is medium complexity, and the LoRA is pretty robust to various object types. If the object is stubborn, replace its name in the prompt with "cake" and it will be easier for the model to understand it.
|
| 73 |
+
|
| 74 |
+
The LoRA, in contrast to *H-press* or *inflate-it*, has been trained on **pure, real videos** of objects which have had turned out to be a cake after being halved.
|
| 75 |
+
|
| 76 |
+
The thing I love the most about LTX-2 is that you are able to set or generate any background music, be it a song or a greek style tune.
|
| 77 |
+
|
| 78 |
+
The SimpleTuner training and dataset configs are under config.json and ltx2-multiresolution-inflate-t2v.json respectively.
|
| 79 |
+
|
| 80 |
+
## Trigger words
|
| 81 |
+
|
| 82 |
+
You should use `c4k3 cakeify it` to trigger the image generation.
|
| 83 |
+
|
| 84 |
+
|
| 85 |
+
## Download model
|
| 86 |
+
|
| 87 |
+
|
| 88 |
+
[Download](/kabachuha/ltx2-cakeify/tree/main) them in the Files & versions tab.
|