Image-to-Video
Diffusers
Safetensors
LTX2Pipeline
text-to-video
video-to-video
image-text-to-video
audio-to-video
text-to-audio
video-to-audio
audio-to-audio
text-to-audio-video
image-to-audio-video
image-text-to-audio-video
ltx-2
ltx-video
ltxv
lightricks
Instructions to use Lightricks/LTX-2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use Lightricks/LTX-2 with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline from diffusers.utils import load_image, export_to_video # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("Lightricks/LTX-2", dtype=torch.bfloat16, device_map="cuda") pipe.to("cuda") prompt = "A man with short gray hair plays a red electric guitar." image = load_image( "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/guitar-man.png" ) output = pipe(image=image, prompt=prompt).frames[0] export_to_video(output, "output.mp4") - Inference
- Notebooks
- Google Colab
- Kaggle
Model is not open source or open weights due to commercial use restrictions. Shared source label is more appropriate.
Browse filesModel is not open source or open weights due to commercial use restrictions. Shared source label is more appropriate. See https://opensource.org/osd for the open source definition.
Calling it open source could confuse users into thinking they can incorporate it into their product freely (at any scale).
README.md
CHANGED
|
@@ -39,9 +39,9 @@ demo: https://app.ltx.studio/ltx-2-playground/i2v
|
|
| 39 |
|
| 40 |
This model card focuses on the LTX-2 model, as presented in the paper [LTX-2: Efficient Joint Audio-Visual Foundation Model](https://huggingface.co/papers/2601.03233). The codebase is available [here](https://github.com/Lightricks/LTX-2).
|
| 41 |
|
| 42 |
-
LTX-2 is a DiT-based audio-video foundation model designed to generate synchronized video and audio within a single model. It brings together the core building blocks of modern video generation, with
|
| 43 |
|
| 44 |
-
[. The codebase is available [here](https://github.com/Lightricks/LTX-2).
|
| 41 |
|
| 42 |
+
LTX-2 is a DiT-based audio-video foundation model designed to generate synchronized video and audio within a single model. It brings together the core building blocks of modern video generation, with shared weights and a focus on practical, local execution.
|
| 43 |
|
| 44 |
+
[](https://www.youtube.com/watch?v=8fWAJXZJbRA)
|
| 45 |
|
| 46 |
# Model Checkpoints
|
| 47 |
|