File size: 4,912 Bytes
ebfa33a
 
891f28b
ebfa33a
 
 
 
891f28b
ebfa33a
 
 
 
891f28b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ebfa33a
 
 
 
 
891f28b
 
 
 
ebfa33a
 
 
 
 
 
 
 
 
 
 
891f28b
ebfa33a
 
 
 
 
891f28b
ebfa33a
 
 
 
 
891f28b
ebfa33a
 
 
 
891f28b
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
---
tags:
- image-to-video
- lora
- diffusers
- template:diffusion-lora
widget:
- text: >-
    The video opens on a cake. A knife, held by a hand, is coming into frame and
    hovering over the cake. The knife then begins cutting into the cake to c4k3
    cakeify it. As the knife slices the cake open, the inside of the cake is
    revealed to be cake with chocolate layers. The knife cuts through and the
    contents of the cake are revealed. Greek music playing in the background.
  output:
    url: video_examples/greek.mp4
- text: >-
    The video opens on a toy penguin. A knife, held by a hand, is coming into
    frame and hovering over the toy penguin. The knife then begins cutting into
    the toy penguin to c4k3 cakeify it. As the knife slices the toy penguin
    open, the inside of the toy penguin is revealed to be cake with chocolate
    layers. The knife cuts through and the contents of the toy penguin are
    revealed. The song: "Happy, happy penguin!" is sung in the background.
  output:
    url: video_examples/penguin.mp4
- text: >-
    The video opens on three game die. A knife, held by a hand, is coming into
    frame and hovering over the three game die. The knife then begins cutting
    into the three game die to c4k3 cakeify them. As the knife slices the three
    game die open, the insides of the three game die are revealed to be cakes
    with chocolate layers. The knife cuts through and the contents of the three
    game die are revealed.
  output:
    url: video_examples/die.mp4
- text: >-
    The video opens on a glass globe. A knife, held by a hand, is coming into
    frame and hovering over glass globe. The knife then begins cutting into the
    glass globe to c4k3 cakeify it. As the knife slices the glass globe open,
    the inside of the glass globe is revealed to be cake with chocolate layers.
    The knife cuts through and the contents of the glass globe are revealed.
  output:
    url: video_examples/ball.mp4
- text: >-
    The video opens on an anime girl. A knife, held by a hand, is coming into
    frame and hovering over the anime girl. The knife then begins cutting into
    the anime girl to c4k3 cakeify it. As the knife slices the anime girl open,
    the inside of the anime girl is revealed to be cake with chocolate layers.
    The knife cuts through and the contents of the anime girl are revealed.
  output:
    url: video_examples/fox.mp4
- text: >-
    The video opens on a castle. A knife, held by a hand, is coming into frame
    and hovering over the castle. The knife then begins cutting into the castle
    to c4k3 cakeify it. As the knife slices the castle open, the inside of the
    castle is revealed to be cake with chocolate layers. The knife cuts through
    and the contents of the castle are revealed.
  output:
    url: video_examples/castle.mp4
base_model: Lightricks/LTX-2
instance_prompt: c4k3 cakeify it
license: other
license_name: ltx-2-community-license-agreement
license_link: https://github.com/Lightricks/LTX-2/blob/main/LICENSE
datasets:
- finetrainers/cakeify-smol
language:
- en
---
# LTX-2 Cakeify

<Gallery />

## Model description 

# LTX-2 Cakeify

We all know the potential of a model reveals itself only when LoRAs are trained. This is my *third* LTX-2 LoRA after [Hydraulic press](https:&#x2F;&#x2F;huggingface.co&#x2F;kabachuha&#x2F;ltx2-hydraulic-press) and [Inflate it](https:&#x2F;&#x2F;huggingface.co&#x2F;kabachuha&#x2F;ltx2-inflate-it). It follows the quest of porting some the *classic* VFX LoRAs to LTX-2 (which has sound!).

This is the second attempt, the first one was with only the attention being tuned. Here, following *inflate-it*, in addition to CREPA and TREAD, I also unfroze **FFN** (&quot;ff.net.0.proj&quot;, &quot;ff.net.2&quot;), however I compensated for its parameter growth by lowerting the overall LoRA rank and the size gain was not that large compared to *H-press* (830 vs 640 Mb, and 1 Gb on *inflate*). The LoRA was trained in **2 hours 41 minutes** (1400 steps) on a single 5090, which I find is medium complexity, and the LoRA is pretty robust to various object types. If the object is stubborn, replace its name in the prompt with &quot;cake&quot; and it will be easier for the model to understand it.

The LoRA, in contrast to *H-press* or *inflate-it*, has been trained on **pure, real videos** of objects which have had turned out to be a cake after being halved.

The thing I love the most about LTX-2 is that you are able to set or generate any background music, be it a song or a greek style tune.

The SimpleTuner training and dataset configs are under config.json and ltx2-multiresolution-cakeify-t2v.json respectively.

## Trigger words

You should use `c4k3 cakeify it` to trigger the image generation.

Also, turn on the sound or prompt or add your own!

## Download model


[Download](/kabachuha/ltx2-cakeify/tree/main) them in the Files & versions tab.