Add Arxiv ID to metadata and improve model card

#17
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +28 -15
README.md CHANGED
@@ -1,5 +1,20 @@
1
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  pipeline_tag: image-to-video
 
3
  tags:
4
  - image-to-video
5
  - text-to-video
@@ -17,25 +32,12 @@ tags:
17
  - ltxv
18
  - lightricks
19
  pinned: true
20
- language:
21
- - en
22
- - de
23
- - es
24
- - fr
25
- - ja
26
- - ko
27
- - zh
28
- - it
29
- - pt
30
- license: other
31
- license_name: ltx-2-community-license-agreement
32
- license_link: https://github.com/Lightricks/LTX-2/blob/main/LICENSE
33
- library_name: diffusers
34
  demo: https://app.ltx.studio/ltx-2-playground/i2v
35
  ---
36
 
37
  # LTX-2 Model Card
38
- This model card focuses on the LTX-2 model, codebase available [here](https://github.com/Lightricks/LTX-2).
 
39
 
40
  LTX-2 is a DiT-based audio-video foundation model designed to generate synchronized video and audio within a single model. It brings together the core building blocks of modern video generation, with open weights and a focus on practical, local execution.
41
 
@@ -116,3 +118,14 @@ The base (dev) model is fully trainable.
116
  It's extremely easy to reproduce the LoRAs and IC-LoRAs we publish with the model by following the instructions on the [LTX-2 Trainer Readme](https://github.com/Lightricks/LTX-2/blob/main/packages/ltx-trainer/README.md).
117
 
118
  Training for motion, style or likeness (sound+appearance) can take less than an hour in many settings.
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
+ - de
5
+ - es
6
+ - fr
7
+ - ja
8
+ - ko
9
+ - zh
10
+ - it
11
+ - pt
12
+ library_name: diffusers
13
+ license: other
14
+ license_name: ltx-2-community-license-agreement
15
+ license_link: https://github.com/Lightricks/LTX-2/blob/main/LICENSE
16
  pipeline_tag: image-to-video
17
+ arxiv: 2601.03233
18
  tags:
19
  - image-to-video
20
  - text-to-video
 
32
  - ltxv
33
  - lightricks
34
  pinned: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
  demo: https://app.ltx.studio/ltx-2-playground/i2v
36
  ---
37
 
38
  # LTX-2 Model Card
39
+
40
+ This model card focuses on the LTX-2 model, as presented in the paper [LTX-2: Efficient Joint Audio-Visual Foundation Model](https://huggingface.co/papers/2601.03233). The codebase is available [here](https://github.com/Lightricks/LTX-2).
41
 
42
  LTX-2 is a DiT-based audio-video foundation model designed to generate synchronized video and audio within a single model. It brings together the core building blocks of modern video generation, with open weights and a focus on practical, local execution.
43
 
 
118
  It's extremely easy to reproduce the LoRAs and IC-LoRAs we publish with the model by following the instructions on the [LTX-2 Trainer Readme](https://github.com/Lightricks/LTX-2/blob/main/packages/ltx-trainer/README.md).
119
 
120
  Training for motion, style or likeness (sound+appearance) can take less than an hour in many settings.
121
+
122
+ ## Citation
123
+
124
+ ```bibtex
125
+ @article{hacohen2025ltx2,
126
+ title={LTX-2: Efficient Joint Audio-Visual Foundation Model},
127
+ author={HaCohen, Yoav and Brazowski, Benny and Chiprut, Nisan and Bitterman, Yaki and Kvochko, Andrew and Berkowitz, Avishai and Shalem, Daniel and Lifschitz, Daphna and Moshe, Dudu and Porat, Eitan and Richardson, Eitan and Guy Shiran and Itay Chachy and Jonathan Chetboun and Michael Finkelson and Michael Kupchick and Nir Zabari and Nitzan Guetta and Noa Kotler and Ofir Bibi and Ori Gordon and Poriya Panet and Roi Benita and Shahar Armon and Victor Kulikov and Yaron Inger and Yonatan Shiftan and Zeev Melumian and Zeev Farbman},
128
+ journal={arXiv preprint arXiv:2601.03233},
129
+ year={2025}
130
+ }
131
+ ```