Lightricks
/

LTX-2.3

@@ -1,4 +1,20 @@
 ---
 tags:
 - image-to-video
 - text-to-video
@@ -12,30 +28,32 @@ tags:
 - image-to-audio-video
 - image-text-to-audio-video
 - ltx-2
 - ltx-video
 - ltxv
 - lightricks
----
-# LTX-2 Model Card
-This model card focuses on the LTX-2 model, as presented in the paper [LTX-2: Efficient Joint Audio-Visual Foundation Model](https://huggingface.co/papers/2601.03233). The codebase is available [here](https://github.com/Lightricks/LTX-2).
-LTX-2 is a DiT-based audio-video foundation model designed to generate synchronized video and audio within a single model. It brings together the core building blocks of modern video generation, with open weights and a focus on practical, local execution.
 [![LTX-2 Open Source](https://img.youtube.com/vi/8fWAJXZJbRA/maxresdefault.jpg)](https://www.youtube.com/watch?v=8fWAJXZJbRA)
 # Model Checkpoints
-| Name                           | Notes                                                                                                          |
-|--------------------------------|----------------------------------------------------------------------------------------------------------------|
-| ltx-2-19b-dev                  | The full model, flexible and trainable in bf16                                                                 |
-| ltx-2-19b-dev-fp8              | The full model in fp8 quantization                                                                             |
-| ltx-2-19b-dev-fp4              | The full model in nvfp4 quantization                                                                           |
-| ltx-2-19b-distilled            | The distilled version of the full model, 8 steps, CFG=1                                                        |
-| ltx-2-19b-distilled-lora-384   | A LoRA version of the distilled model applicable to the full model                                             |
-| ltx-2-spatial-upscaler-x2-1.0  | An x2 spatial upscaler for the ltx-2 latents, used in multi stage (multiscale) pipelines for higher resolution |
-| ltx-2-temporal-upscaler-x2-1.0 | An x2 temporal upscaler for the ltx-2 latents, used in multi stage (multiscale) pipelines for higher FPS       |
 ## Model Details
 - **Developed by:** Lightricks
@@ -43,7 +61,7 @@ LTX-2 is a DiT-based audio-video foundation model designed to generate synchroni
 - **Language(s):** English
 # Online demo
-LTX-2 is accessible right away via the following links:
 - [LTX-Studio text-to-video](https://app.ltx.studio/ltx-2-playground/t2v)
 - [LTX-Studio image-to-video](https://app.ltx.studio/ltx-2-playground/i2v)

 ---
+language:
+- en
+- de
+- es
+- fr
+- ja
+- ko
+- zh
+- it
+- pt
+library_name: diffusers
+license: other
+license_name: ltx-2-community-license-agreement
+license_link: https://github.com/Lightricks/LTX-2/blob/main/LICENSE
+pipeline_tag: image-to-video
+arxiv: 2601.03233
 tags:
 - image-to-video
 - text-to-video
 - image-to-audio-video
 - image-text-to-audio-video
 - ltx-2
+- ltx-2.3
 - ltx-video
 - ltxv
 - lightricks
+pinned: true
+demo: https://app.ltx.studio/ltx-2-playground/i2v---
+# LTX-2.3 Model Card
+This model card focuses on the LTX-2.3 model, which is a significant update to the [LTX-2 model](https://huggingface.co/Lightricks/LTX-2) with improved audio and visual quality as well as enhanced prompt adherence.
+LTX-2 was presented in the paper [LTX-2: Efficient Joint Audio-Visual Foundation Model](https://huggingface.co/papers/2601.03233). The codebase is available [here](https://github.com/Lightricks/LTX-2).
+LTX-2.3 is a DiT-based audio-video foundation model designed to generate synchronized video and audio within a single model. It brings together the core building blocks of modern video generation, with open weights and a focus on practical, local execution.
 [![LTX-2 Open Source](https://img.youtube.com/vi/8fWAJXZJbRA/maxresdefault.jpg)](https://www.youtube.com/watch?v=8fWAJXZJbRA)
 # Model Checkpoints
+| Name                               | Notes                                                                                                              |
+|------------------------------------|--------------------------------------------------------------------------------------------------------------------|
+| ltx-2.3-20b-dev                    | The full model, flexible and trainable in bf16                                                                     |
+| ltx-2.3-20b-distilled              | The distilled version of the full model, 8 steps, CFG=1                                                            |
+| ltx-2.3-20b-distilled-lora-384     | A LoRA version of the distilled model applicable to the full model                                                 |
+| ltx-2.3-spatial-upscaler-x2-1.0    | An x2 spatial upscaler for the ltx-2.3 latents, used in multi stage (multiscale) pipelines for higher resolution   |
+| ltx-2.3-spatial-upscaler-x1.5-1.0  | An x1.5 spatial upscaler for the ltx-2.3 latents, used in multi stage (multiscale) pipelines for higher resolution |
+| ltx-2.3-temporal-upscaler-x2-1.0   | An x2 temporal upscaler for the ltx-2.3 latents, used in multi stage (multiscale) pipelines for higher FPS         |
 ## Model Details
 - **Developed by:** Lightricks
 - **Language(s):** English
 # Online demo
+LTX-2.3 is accessible right away via the following links:
 - [LTX-Studio text-to-video](https://app.ltx.studio/ltx-2-playground/t2v)
 - [LTX-Studio image-to-video](https://app.ltx.studio/ltx-2-playground/i2v)