LTX-2 Hydraulic press trained at home in just 1.5 hours!

We all know the potential of a model reveals itself only when LoRAs are trained. However, when I first trained LTX-2, even after 11 hours, the results were subpar. Then, at day, I activated CREPA https://arxiv.org/pdf/2506.09229, which is extending the known REPA (representation alignment) technique for videos, and which has also helped me much with Kandinsky 5 trainability struggles. With CREPA and Prodigy as the optimizer (and TREAD, but I also used it for the failed training run), musubi block swap at 2, CREPA using backbone features, the iteration speed was 5.75s/it using SimpleTuner on a single 5090. The training VRAM usage is 30.4 GB.

Note, that this is not yet convergence (~90/100 of the training loss curve saturation), this is a "good enough" result to show the training speedup, because the results are diminishing later. Also, trained without sound because of SimpleTuner's implementation not finished completely at the time of the training.

There is a kink that the objects start subtly compressing before the press hits them, but it's a purely dataset related thing. (synthetic OmniVFX from Huggingface)

I think the CREPA-Prodigy-mix speedup is insane, because in r/StableDiffusion subreddit is was reported that with the official trainer LoRA training takes as much as 9 hours on a 5090 and 1 hour on rented Cloud hardware.

The training and dataset config are under config.json and ltx2-multiresolution-crush-t2v.json respectively.

Prompt
The video begins with a tank. A hydraulic press positioned above slowly descends towards the tank. Upon contact, the hydraulic press c5us4 crushes it, deforming and flattening the tank, causing the tank to collapse inward until the tank is no longer recognizable.
Prompt
The video begins with an anime girl. A hydraulic press positioned above slowly descends towards the anime girl. Upon contact, the hydraulic press c5us4 crushes it, deforming and flattening the anime girl, causing the anime girl to collapse inward until the anime girl is no longer recognizable.
Prompt
The video begins with a tank. A hydraulic press positioned above slowly descends towards the tank. Upon contact, the hydraulic press c5us4 crushes it, deforming and flattening the tank, causing the tank to collapse inward until the tank is no longer recognizable.
Prompt
The video begins with an anime girl. A hydraulic press positioned above slowly descends towards the anime girl. Upon contact, the hydraulic press c5us4 crushes it, deforming and flattening the anime girl, causing the anime girl to collapse inward until the anime girl is no longer recognizable.
Prompt
The video begins with pyramids. A hydraulic press positioned above slowly descends towards the pyramids. Upon contact, the hydraulic press c5us4 crushes it, deforming and flattening the pyramids, causing the pyramids to collapse inward until the pyramids is no longer recognizable.
Downloads last month
20
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kabachuha/ltx2-hydraulic-press

Base model

Lightricks/LTX-2
Adapter
(13)
this model

Dataset used to train kabachuha/ltx2-hydraulic-press

Paper for kabachuha/ltx2-hydraulic-press