Add README and example videos

Browse files

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Files changed (5) hide show

.gitattributes +1 -0
AnimateDiff_00774.mp4 +3 -0
AnimateDiff_00777.mp4 +3 -0
AnimateDiff_00778.mp4 +3 -0
README.md +57 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.mp4 filter=lfs diff=lfs merge=lfs -text

AnimateDiff_00774.mp4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6644856f28b9b445cd02a9ba9f297e18cc076a7b32c9a9572ab2925fa01a8bd9
+size 5364557

AnimateDiff_00777.mp4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e3219f166e7f849e79bb3d3a1c817bc168fdab4cd087598a7c14c75e0542cf13
+size 2537748

AnimateDiff_00778.mp4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5ecb6edfe62cd8e34e5f9804e8f69081bf8faec4f757a01e82786c39ece91949
+size 4326717

README.md ADDED Viewed

	@@ -0,0 +1,57 @@

+# LTX-2 Image-to-Video Adapter LoRA
+A high-rank LoRA adapter for [LTX-Video 2](https://github.com/Lightricks/LTX-Video) that substantially improves image-to-video generation quality. No complex workflows, no image preprocessing, no compression tricks -- just a direct image embedding pipeline that works.
+## What This Is
+This LoRA was trained on **30,000 generated videos** spanning a wide range of subjects, styles, and motion types. The result is a highly generalized adapter that strengthens LTX-2's ability to take a single image and produce coherent, high-fidelity video from it.
+### Key Specs
+| Parameter | Value |
+|-----------|-------|
+| **Base Model** | LTX-Video 2 |
+| **LoRA Rank** | 256 |
+| **Training Set** | ~30,000 generated videos |
+| **Training Scope** | Visual only (no explicit audio training) |
+## What It Does
+- **Improved image fidelity** -- the generated video maintains stronger adherence to the source image with less drift or distortion across frames.
+- **Better motion coherence** -- subjects move more naturally and consistently throughout the clip.
+- **Broader generalization** -- performs well across diverse subjects and scenes without needing per-category tuning.
+- **Zero-workflow overhead** -- no ControlNet, no IP-Adapter stacking, no image manipulation required. Load the LoRA, attach an image embedding, prompt, and generate.
+### A Note on Audio
+Audio was **not** explicitly trained into this LoRA. However, due to the nature of how LTX-2 handles its latent space, there are subtle shifts in audio output compared to the base model. This is a side effect of the training process, not an intentional feature.
+## Usage (ComfyUI)
+1. Place the LoRA file in your `ComfyUI/models/loras/` directory.
+2. Add an **LTX-2** model loader node and load the base LTX-2 checkpoint.
+3. Add a **Load LoRA** node and select this adapter.
+4. Connect an **image embedding** node with your source image.
+5. Add your text prompt and generate.
+No additional nodes, preprocessing steps, or auxiliary models are needed.
+## Examples
+Three reference videos demonstrating the adapter's output quality:
+<video src="https://huggingface.co/MachineDelusions/LTX-2_Image2Video_Adapter_LoRa/resolve/main/AnimateDiff_00774.mp4" autoplay loop muted playsinline></video>
+<video src="https://huggingface.co/MachineDelusions/LTX-2_Image2Video_Adapter_LoRa/resolve/main/AnimateDiff_00777.mp4" autoplay loop muted playsinline></video>
+<video src="https://huggingface.co/MachineDelusions/LTX-2_Image2Video_Adapter_LoRa/resolve/main/AnimateDiff_00778.mp4" autoplay loop muted playsinline></video>
+## Model Details
+- **Architecture:** LoRA (Low-Rank Adaptation) applied to LTX-Video 2's transformer layers
+- **Rank 256** provides a high-capacity adaptation while remaining efficient to load and merge
+- **Training data** was intentionally diverse to avoid overfitting to any single domain, producing a general-purpose image-to-video adapter rather than a style-specific fine-tune
+## License
+Please refer to the [LTX-Video license](https://github.com/Lightricks/LTX-Video) for base model terms.