ofirbibi commited on
Commit
91a7190
·
1 Parent(s): 9b6bbe0

Docs: Update version and files

Browse files
Files changed (1) hide show
  1. README.md +32 -14
README.md CHANGED
@@ -1,4 +1,20 @@
1
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  tags:
3
  - image-to-video
4
  - text-to-video
@@ -12,30 +28,32 @@ tags:
12
  - image-to-audio-video
13
  - image-text-to-audio-video
14
  - ltx-2
 
15
  - ltx-video
16
  - ltxv
17
  - lightricks
18
- ---
 
19
 
20
- # LTX-2 Model Card
21
 
22
- This model card focuses on the LTX-2 model, as presented in the paper [LTX-2: Efficient Joint Audio-Visual Foundation Model](https://huggingface.co/papers/2601.03233). The codebase is available [here](https://github.com/Lightricks/LTX-2).
 
23
 
24
- LTX-2 is a DiT-based audio-video foundation model designed to generate synchronized video and audio within a single model. It brings together the core building blocks of modern video generation, with open weights and a focus on practical, local execution.
25
 
26
  [![LTX-2 Open Source](https://img.youtube.com/vi/8fWAJXZJbRA/maxresdefault.jpg)](https://www.youtube.com/watch?v=8fWAJXZJbRA)
27
 
28
  # Model Checkpoints
29
 
30
- | Name | Notes |
31
- |--------------------------------|----------------------------------------------------------------------------------------------------------------|
32
- | ltx-2-19b-dev | The full model, flexible and trainable in bf16 |
33
- | ltx-2-19b-dev-fp8 | The full model in fp8 quantization |
34
- | ltx-2-19b-dev-fp4 | The full model in nvfp4 quantization |
35
- | ltx-2-19b-distilled | The distilled version of the full model, 8 steps, CFG=1 |
36
- | ltx-2-19b-distilled-lora-384 | A LoRA version of the distilled model applicable to the full model |
37
- | ltx-2-spatial-upscaler-x2-1.0 | An x2 spatial upscaler for the ltx-2 latents, used in multi stage (multiscale) pipelines for higher resolution |
38
- | ltx-2-temporal-upscaler-x2-1.0 | An x2 temporal upscaler for the ltx-2 latents, used in multi stage (multiscale) pipelines for higher FPS |
39
 
40
  ## Model Details
41
  - **Developed by:** Lightricks
@@ -43,7 +61,7 @@ LTX-2 is a DiT-based audio-video foundation model designed to generate synchroni
43
  - **Language(s):** English
44
 
45
  # Online demo
46
- LTX-2 is accessible right away via the following links:
47
  - [LTX-Studio text-to-video](https://app.ltx.studio/ltx-2-playground/t2v)
48
  - [LTX-Studio image-to-video](https://app.ltx.studio/ltx-2-playground/i2v)
49
 
 
1
  ---
2
+ language:
3
+ - en
4
+ - de
5
+ - es
6
+ - fr
7
+ - ja
8
+ - ko
9
+ - zh
10
+ - it
11
+ - pt
12
+ library_name: diffusers
13
+ license: other
14
+ license_name: ltx-2-community-license-agreement
15
+ license_link: https://github.com/Lightricks/LTX-2/blob/main/LICENSE
16
+ pipeline_tag: image-to-video
17
+ arxiv: 2601.03233
18
  tags:
19
  - image-to-video
20
  - text-to-video
 
28
  - image-to-audio-video
29
  - image-text-to-audio-video
30
  - ltx-2
31
+ - ltx-2.3
32
  - ltx-video
33
  - ltxv
34
  - lightricks
35
+ pinned: true
36
+ demo: https://app.ltx.studio/ltx-2-playground/i2v---
37
 
38
+ # LTX-2.3 Model Card
39
 
40
+ This model card focuses on the LTX-2.3 model, which is a significant update to the [LTX-2 model](https://huggingface.co/Lightricks/LTX-2) with improved audio and visual quality as well as enhanced prompt adherence.
41
+ LTX-2 was presented in the paper [LTX-2: Efficient Joint Audio-Visual Foundation Model](https://huggingface.co/papers/2601.03233). The codebase is available [here](https://github.com/Lightricks/LTX-2).
42
 
43
+ LTX-2.3 is a DiT-based audio-video foundation model designed to generate synchronized video and audio within a single model. It brings together the core building blocks of modern video generation, with open weights and a focus on practical, local execution.
44
 
45
  [![LTX-2 Open Source](https://img.youtube.com/vi/8fWAJXZJbRA/maxresdefault.jpg)](https://www.youtube.com/watch?v=8fWAJXZJbRA)
46
 
47
  # Model Checkpoints
48
 
49
+ | Name | Notes |
50
+ |------------------------------------|--------------------------------------------------------------------------------------------------------------------|
51
+ | ltx-2.3-20b-dev | The full model, flexible and trainable in bf16 |
52
+ | ltx-2.3-20b-distilled | The distilled version of the full model, 8 steps, CFG=1 |
53
+ | ltx-2.3-20b-distilled-lora-384 | A LoRA version of the distilled model applicable to the full model |
54
+ | ltx-2.3-spatial-upscaler-x2-1.0 | An x2 spatial upscaler for the ltx-2.3 latents, used in multi stage (multiscale) pipelines for higher resolution |
55
+ | ltx-2.3-spatial-upscaler-x1.5-1.0 | An x1.5 spatial upscaler for the ltx-2.3 latents, used in multi stage (multiscale) pipelines for higher resolution |
56
+ | ltx-2.3-temporal-upscaler-x2-1.0 | An x2 temporal upscaler for the ltx-2.3 latents, used in multi stage (multiscale) pipelines for higher FPS |
 
57
 
58
  ## Model Details
59
  - **Developed by:** Lightricks
 
61
  - **Language(s):** English
62
 
63
  # Online demo
64
+ LTX-2.3 is accessible right away via the following links:
65
  - [LTX-Studio text-to-video](https://app.ltx.studio/ltx-2-playground/t2v)
66
  - [LTX-Studio image-to-video](https://app.ltx.studio/ltx-2-playground/i2v)
67