Sentinel7 commited on
Commit
69b6d7f
·
verified ·
1 Parent(s): f17f65f

Upload 2137833/2496344/README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. 2137833/2496344/README.md +130 -4
2137833/2496344/README.md CHANGED
@@ -1,11 +1,137 @@
1
-
2
  ---
3
  license: other
4
  tags:
5
- - civitai
6
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  Author: [SeoulSeeker](https://civitai.com/user/SeoulSeeker)
8
 
9
- Model: [https://civitai.com/models/2137833?modelVersionId=2496344](https://civitai.com/models/2137833?modelVersionId=2496344)
 
 
10
 
11
- Mirror: [https://civarchive.com/models/2137833?modelVersionId=2496344](https://civarchive.com/models/2137833?modelVersionId=2496344)
 
 
1
  ---
2
  license: other
3
  tags:
4
+ - tool
5
  ---
6
+
7
+ # (NSFW) Dead-Simple MMAudio + RIFE Interpolation Setup for WAN 2.2 I2V 14B - v1.0.3 - 2496344
8
+
9
+ **Model Type**: Workflows
10
+
11
+ **Base Model**: Wan Video 2.2 I2V-A14B
12
+
13
+ **Trigger Words**: None
14
+
15
+ **Tags**: tool
16
+
17
+ ## Gallery
18
+
19
+ <table>
20
+ <tr>
21
+ <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/94c93ff6-aef6-4875-a8f8-a0378577fca2/original=true/113737517.mp4" width="200" controls muted autoplay loop></video></td>
22
+ <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/03fb3741-9a7d-411f-915c-d459c691f3b1/original=true/113581266.mp4" width="200" controls muted autoplay loop></video></td>
23
+ <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/bb2e9298-67f7-46e8-9fab-90b68582bcde/original=true/113581611.mp4" width="200" controls muted autoplay loop></video></td>
24
+ </tr>
25
+ <tr>
26
+ <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/365d761e-1e73-495c-b387-204aef520014/original=true/115784564.mp4" width="200" controls muted autoplay loop></video></td>
27
+ </tr>
28
+ </table>
29
+
30
+ ## Description
31
+
32
+ Changelog
33
+
34
+ **Version 1.0.3**: Connected both steps so no more re-uploading is required. Just upload your video in Step 1 and hit Run.
35
+
36
+ **Version 1.0.2:** Changed VHS nodes to VHS ffmpeg nodes to avoid color drift (thank you LastAssignment). Also changed FPS flow from 24 to 25 to more closely align to MMAudio specs.
37
+
38
+ **Version 1.0.1**: RIFE Group output was set to 8fps by accident. Changed it to 24fps
39
+
40
+ **Version 1.0**: Initial release
41
+
42
+ **A TRIBUTE TO GOONERS EVERYWHERE**
43
+
44
+ ### Your WAN 2.2 video is great. It looks awesome. But where's the sound? We moved from images to videos, and WAN 2.2 is incredible for video. The missing piece...AUDIO!
45
+
46
+ This is my first article ever, so I'm sorry if I made any mistakes. Please leave a comment if I've made an error or if you need any help. For your reference, I'm running:
47
+
48
+ * ComfyUI 0.3.68
49
+ * Torch 2.9
50
+ * CUDA 13
51
+ * Python 3.13.9
52
+ * Sage Attention 2.2
53
+ * NVIDIA 5070 Ti (16gb vram)
54
+
55
+ And here are the custom nodes (3 in total):
56
+
57
+ * **ComfyUI-VideoHelperSuite** 1.7.7 (<https://github.com/Kosinkadink/ComfyUI-VideoHelperSuite>)
58
+ * **ComfyUI-MMAudio** Nightly (<https://github.com/kijai/ComfyUI-MMAudio>)
59
+
60
+ + I recommend manually git cloning this node pack into your /ComfyUI/models/custom\_nodes folder and then installing the requirements.txt file using your embedded python. I'm on portable Comfy, so the command would look something like this:
61
+
62
+ - "C:\ComfyUI\python\_embeded\python.exe" -m pip install -r "C:\ComfyUI\ComfyUI\custom\_nodes\ComfyUI-MMAudio\requirements.txt"
63
+
64
+ * **ComfyUI-VFI** Unknown (<https://github.com/GACLove/ComfyUI-VFI>)
65
+
66
+ + I think there's a more popular RIFE custom node that a lot of other people use, but Icouldn't figure out how to get fractional multiples for interpolation (16 -> 25fps is a ~1.5x interpolation), but this node allows it.
67
+
68
+ Onto the workflow...
69
+
70
+ ------------------------------------
71
+
72
+ This workflow handles two jobs:
73
+ -------------------------------
74
+
75
+ 1. Fix WAN 2.2’s native 16fps output by interpolating it to 25fps with RIFE.
76
+ 2. Generate synced audio with MMAudio using the final 25fps video.
77
+
78
+ The setup is plug-and-play. Drop in your WAN video → interpolate → feed it into MMAudio → get synced output. The included notes explain the reasoning for FPS, step settings, and seed behavior.
79
+
80
+ What this workflow covers:
81
+
82
+ 1. RIFE interpolation from 16 → 25 fps.
83
+ 2. MMAudio sampler
84
+
85
+ 1. Upon some further testing, 50-100 steps works well. The node runs pretty fast in general, and it's also worthwhile toying with CFG (4.5 - 8). 100 steps and CFG 8 works well for high-quality output and better prompt adherence.
86
+ 3. Automatic audio + video combine at 25fps.
87
+ 4. Optional re-interpolation afterward if you want 30fps+ output.
88
+
89
+ 1. You can plug your finished 25fps video into the 'Step 1: Rife Interpolation' group and just change the 'source\_fps' to 25 and the 'target\_fps' to 30.
90
+
91
+ **Required MMAudio files**
92
+ --------------------------
93
+
94
+ Download all of these into:
95
+
96
+ ComfyUI/models/mmaudio
97
+
98
+ **MMAudio NSFW Model (fine-tuned off the base model)**
99
+
100
+ <https://huggingface.co/phazei/NSFW_MMaudio/resolve/main/mmaudio_large_44k_nsfw_gold_8.5k_final_fp16.safetensors?download=true>
101
+
102
+ **MMAudio VAE (fp16)**
103
+
104
+ <https://huggingface.co/Kijai/MMAudio_safetensors/resolve/5984623e6b436818c6ff287ef6eec93e3e05aa3f/mmaudio_vae_44k_fp16.safetensors>
105
+
106
+ **MMAudio Synchformer (fp16)**
107
+
108
+ <https://huggingface.co/Kijai/MMAudio_safetensors/resolve/main/mmaudio_synchformer_fp16.safetensors>
109
+
110
+ **MMAudio CLIP Encoder (fp16)**
111
+
112
+ <https://huggingface.co/Kijai/MMAudio_safetensors/resolve/main/apple_DFN5B-CLIP-ViT-H-14-384_fp16.safetensors>
113
+
114
+ **Nvidia BigVGAN v2 24KHz 100band 512x**
115
+
116
+ This seems to be required for MMAudio to work. You can manually download all the files, git clone, or use the HuggingFace CLI tool (huggingface-cli repo clone URL). The repo should be placed in the ComfyUI/models/mmaudio folder.
117
+
118
+ <https://huggingface.co/nvidia/bigvgan_v2_44khz_128band_512x>
119
+
120
+ Bonus
121
+ -----
122
+
123
+ Once you've created a good MMAudio track, there are some further steps you can take depending on what you'd like to create.
124
+
125
+ 1. Import your audio/video into some type of software (CapCut/Shotcut) and layer on some music in the background. I've done this with a few of my videos. I added a 'radio' filter to make it seem like the music was kinda tinny and playing in the background.
126
+
127
+ 2. Layer other audio tracks alongside the NSFW audio track. You can see KaptainSisay very elegantly did something like that here (<https://civitai.com/images/110700679>)
128
+
129
+ ---
130
+
131
  Author: [SeoulSeeker](https://civitai.com/user/SeoulSeeker)
132
 
133
+ Model: [CivitAI Model Page](https://civitai.com/models/2137833?modelVersionId=2496344)
134
+
135
+ Archive: [CivArchive Page](https://civarchive.com/models/2137833?modelVersionId=2496344)
136
 
137
+ <!-- Version: 20260502_upload -->