File size: 2,534 Bytes
8f6fd80
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
699dcc2
8f6fd80
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
699dcc2
 
8f6fd80
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
---
license: apache-2.0
language:
- en
- zh
tags:
- image-to-video
- lora
- replicate
- text-to-video
- video
- video-generation
base_model: "Wan-AI/Wan2.1-T2V-14B-Diffusers"
pipeline_tag: text-to-video
# widget:
#   - text: >-
#       prompt
#     output:
#       url: https://...
instance_prompt: woodward
---

# Thinking Out Loud

<Gallery />

## About this LoRA

This is a [LoRA](https://replicate.com/docs/guides/working-with-loras) for the Wan 14B Text-to-Video model.

It can be used with diffusers or ComfyUI, and can be loaded against the Wan 14B models.

It was trained on [Replicate](https://replicate.com/) with 500 steps at a learning rate of 5e-05 and LoRA rank of 32.


## Trigger word

You should use `woodward` to trigger the video generation.


## Use this LoRA

Replicate has a collection of Wan models that are optimised for speed and cost. They can also be used with this LoRA:

- https://replicate.com/collections/wan-video
- https://replicate.com/fofr/wan-with-lora

### Run this LoRA with an API using Replicate

```py
import replicate

input = {
    "prompt": "woodward",
    "lora_url": "https://huggingface.co/hanani/Thinking-Out-Loud/resolve/main/wan-14b-t2v-woodward-lora.safetensors"
}

output = replicate.run(
    "fofr/wan-with-lora:latest",
    model="14B",
    input=input
)
for index, item in enumerate(output):
    with open(f"output_{index}.mp4", "wb") as file:
        file.write(item.read())
```

### Using with Diffusers

```py
import torch
from diffusers.utils import export_to_video
from diffusers import WanVidAdapter, WanVid

# Load base model
base_model = WanVid.from_pretrained("Wan-AI/Wan2.1-T2V-14B-Diffusers", torch_dtype=torch.float16)

# Load and apply LoRA adapter
adapter = WanVidAdapter.from_pretrained("hanani/Thinking-Out-Loud")
base_model.load_adapter(adapter)

# Generate video
prompt = "woodward"
negative_prompt = "blurry, low quality, low resolution"

# Generate video frames
frames = base_model(
    prompt=prompt,
    negative_prompt=negative_prompt,
    num_inference_steps=30,
    guidance_scale=5.0,
    width=832,
    height=480,
    fps=16,
    num_frames=32,
).frames[0]

# Save as video
video_path = "output.mp4"
export_to_video(frames, video_path, fps=16)
print(f"Video saved to: {video_path}")
```


## Training details

- Steps: 500
- Learning rate: 5e-05
- LoRA rank: 32


## Contribute your own examples

You can use the [community tab](https://huggingface.co/hanani/Thinking-Out-Loud/discussions) to add videos that show off what you've made with this LoRA.