Lightricks
/

LTX-2-19b-IC-LoRA-Union-Control

@@ -7,7 +7,7 @@ language:
 - en
 license: other
 license_name: ltx-2-community-license
-license_link: https://www.github.com/Lightricks/LTX-2/LICENSE
 pipeline_tag: any-to-any
 tags:
 - ltx-video
@@ -18,7 +18,8 @@ pinned: true
 # LTX-2 19B IC-LoRA Union Control
-This is a unified control IC-LoRA trained on top of **LTX-2-19b**, enabling multiple control signals to be used for video generation from text and reference frames.
 It is based on the [LTX-2](https://huggingface.co/papers/2601.03233) foundation model.
@@ -29,9 +30,9 @@ It is based on the [LTX-2](https://huggingface.co/papers/2601.03233) foundation
 ## What is In-Context LoRA (IC LoRA)?
 IC LoRA enables conditioning video generation on reference video frames at inference time, allowing fine-grained video-to-video control on top of a text-to-video, base model.
-It allows also the usage of an intial image for image-to-video, and generate audio-visual output.
-## What is Reference downscale factor?
 IC LoRA uses a reference control signal, i.e. a video that is positionally aligned to the generated video and contains the reference for context.
 To allow for added efficiency, the reference video can be smaller, so it consumes less tokens.
@@ -51,10 +52,13 @@ See the **LTX-2-community-license** for full terms.
 - **Base Model:** LTX-2-19b Video
 - **Training Type:** IC LoRA
 - **Control Type:** Union conditioning - Canny + Depth + Pose
 ### 🔌 Using in ComfyUI
 1. Copy the LoRA weights into `models/loras`.
 2. Use the official IC-LoRA workflow from the [LTX-2 ComfyUI repository](https://github.com/Lightricks/ComfyUI-LTXVideo/).
 ## Dataset
@@ -69,6 +73,12 @@ The model was trained using the [Lightricks/Canny-Control-Dataset](https://huggi
   journal={arXiv preprint arXiv:2601.03233},
   year={2025}
 }
 ```
 ## Acknowledgments

 - en
 license: other
 license_name: ltx-2-community-license
+license_link: https://github.com/Lightricks/LTX-2/blob/main/LICENSE
 pipeline_tag: any-to-any
 tags:
 - ltx-video
 # LTX-2 19B IC-LoRA Union Control
+This is a unified control IC-LoRA trained on top of **LTX-2-19b**, enabling multiple control signals to be used for video generation from text and reference frames.
+It was trained with downscaled reference latents by a factor of 2.
 It is based on the [LTX-2](https://huggingface.co/papers/2601.03233) foundation model.
 ## What is In-Context LoRA (IC LoRA)?
 IC LoRA enables conditioning video generation on reference video frames at inference time, allowing fine-grained video-to-video control on top of a text-to-video, base model.
+It allows also the usage of an initial image for image-to-video, and generate audio-visual output.
+## What is Reference Downscale Factor?
 IC LoRA uses a reference control signal, i.e. a video that is positionally aligned to the generated video and contains the reference for context.
 To allow for added efficiency, the reference video can be smaller, so it consumes less tokens.
 - **Base Model:** LTX-2-19b Video
 - **Training Type:** IC LoRA
 - **Control Type:** Union conditioning - Canny + Depth + Pose
+- **Reference Downscale Factor:** 2 (reference resolution is 0.5x the output resolution)
 ### 🔌 Using in ComfyUI
 1. Copy the LoRA weights into `models/loras`.
 2. Use the official IC-LoRA workflow from the [LTX-2 ComfyUI repository](https://github.com/Lightricks/ComfyUI-LTXVideo/).
+3. Make sure to use the nodes supporting Reference Downscale Factor: LTXICLoRALoaderModelOnly to load the lora and extract the downscale factor, and LTXAddVideoICLoRAGuide to add the small latent as a guide.
 ## Dataset
   journal={arXiv preprint arXiv:2601.03233},
   year={2025}
 }
+@misc{LTXVideoTrainer2025,
+  title={LTX-Video Community Trainer},
+  author={Matan Ben Yosef and Naomi Ken Korem and Tavi Halperin},
+  year={2025},
+  publisher={GitHub},
+}
 ```
 ## Acknowledgments