Upload 2137833/2418378/README.md with huggingface_hub
Browse files- 2137833/2418378/README.md +140 -4
2137833/2418378/README.md
CHANGED
|
@@ -1,11 +1,147 @@
|
|
| 1 |
-
|
| 2 |
---
|
| 3 |
license: other
|
| 4 |
tags:
|
| 5 |
-
-
|
| 6 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
Author: [SeoulSeeker](https://civitai.com/user/SeoulSeeker)
|
| 8 |
|
| 9 |
-
Model: [
|
|
|
|
|
|
|
| 10 |
|
| 11 |
-
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: other
|
| 3 |
tags:
|
| 4 |
+
- tool
|
| 5 |
---
|
| 6 |
+
|
| 7 |
+
# (NSFW) Dead-Simple MMAudio + RIFE Interpolation Setup for WAN 2.2 I2V 14B - v1.0.1 - 2418378
|
| 8 |
+
|
| 9 |
+
**Model Type**: Workflows
|
| 10 |
+
|
| 11 |
+
**Base Model**: Wan Video 2.2 I2V-A14B
|
| 12 |
+
|
| 13 |
+
**Trigger Words**: None
|
| 14 |
+
|
| 15 |
+
**Tags**: tool
|
| 16 |
+
|
| 17 |
+
## Gallery
|
| 18 |
+
|
| 19 |
+
<table>
|
| 20 |
+
<tr>
|
| 21 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/2d70c3cc-ae0b-4c29-814f-15ccbd702bfd/original=true/110680919.mp4" width="200" controls muted autoplay loop></video></td>
|
| 22 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/1bb32080-5f21-4a02-b0d9-118161840166/original=true/110681031.mp4" width="200" controls muted autoplay loop></video></td>
|
| 23 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/7f59ad48-0d2f-4305-90df-3b74d37ecbf4/original=true/110681469.mp4" width="200" controls muted autoplay loop></video></td>
|
| 24 |
+
</tr>
|
| 25 |
+
<tr>
|
| 26 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/a5836950-54b0-4df8-8971-33201617739f/original=true/110681546.mp4" width="200" controls muted autoplay loop></video></td>
|
| 27 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/119cbc76-070e-4f42-9029-588b27ed402e/original=true/110814229.mp4" width="200" controls muted autoplay loop></video></td>
|
| 28 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/08b9bd8a-476d-46df-bd81-75c9a4cc6b42/original=true/111285113.mp4" width="200" controls muted autoplay loop></video></td>
|
| 29 |
+
</tr>
|
| 30 |
+
<tr>
|
| 31 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/94dd0779-af13-408e-b2ad-0d465599e2cc/original=true/111285242.mp4" width="200" controls muted autoplay loop></video></td>
|
| 32 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/b85e85d7-e037-4723-a064-31b75a81355d/original=true/111285628.mp4" width="200" controls muted autoplay loop></video></td>
|
| 33 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/c710cd5a-4ef4-4ab0-a90b-802674d76ea4/original=true/111285699.mp4" width="200" controls muted autoplay loop></video></td>
|
| 34 |
+
</tr>
|
| 35 |
+
<tr>
|
| 36 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/514d1e68-d9d8-49c8-8e74-3e56a5310546/original=true/111285744.mp4" width="200" controls muted autoplay loop></video></td>
|
| 37 |
+
</tr>
|
| 38 |
+
</table>
|
| 39 |
+
|
| 40 |
+
## Description
|
| 41 |
+
|
| 42 |
+
Changelog
|
| 43 |
+
|
| 44 |
+
**Version 1.0.3**: Connected both steps so no more re-uploading is required. Just upload your video in Step 1 and hit Run.
|
| 45 |
+
|
| 46 |
+
**Version 1.0.2:** Changed VHS nodes to VHS ffmpeg nodes to avoid color drift (thank you LastAssignment). Also changed FPS flow from 24 to 25 to more closely align to MMAudio specs.
|
| 47 |
+
|
| 48 |
+
**Version 1.0.1**: RIFE Group output was set to 8fps by accident. Changed it to 24fps
|
| 49 |
+
|
| 50 |
+
**Version 1.0**: Initial release
|
| 51 |
+
|
| 52 |
+
**A TRIBUTE TO GOONERS EVERYWHERE**
|
| 53 |
+
|
| 54 |
+
### Your WAN 2.2 video is great. It looks awesome. But where's the sound? We moved from images to videos, and WAN 2.2 is incredible for video. The missing piece...AUDIO!
|
| 55 |
+
|
| 56 |
+
This is my first article ever, so I'm sorry if I made any mistakes. Please leave a comment if I've made an error or if you need any help. For your reference, I'm running:
|
| 57 |
+
|
| 58 |
+
* ComfyUI 0.3.68
|
| 59 |
+
* Torch 2.9
|
| 60 |
+
* CUDA 13
|
| 61 |
+
* Python 3.13.9
|
| 62 |
+
* Sage Attention 2.2
|
| 63 |
+
* NVIDIA 5070 Ti (16gb vram)
|
| 64 |
+
|
| 65 |
+
And here are the custom nodes (3 in total):
|
| 66 |
+
|
| 67 |
+
* **ComfyUI-VideoHelperSuite** 1.7.7 (<https://github.com/Kosinkadink/ComfyUI-VideoHelperSuite>)
|
| 68 |
+
* **ComfyUI-MMAudio** Nightly (<https://github.com/kijai/ComfyUI-MMAudio>)
|
| 69 |
+
|
| 70 |
+
+ I recommend manually git cloning this node pack into your /ComfyUI/models/custom\_nodes folder and then installing the requirements.txt file using your embedded python. I'm on portable Comfy, so the command would look something like this:
|
| 71 |
+
|
| 72 |
+
- "C:\ComfyUI\python\_embeded\python.exe" -m pip install -r "C:\ComfyUI\ComfyUI\custom\_nodes\ComfyUI-MMAudio\requirements.txt"
|
| 73 |
+
|
| 74 |
+
* **ComfyUI-VFI** Unknown (<https://github.com/GACLove/ComfyUI-VFI>)
|
| 75 |
+
|
| 76 |
+
+ I think there's a more popular RIFE custom node that a lot of other people use, but Icouldn't figure out how to get fractional multiples for interpolation (16 -> 25fps is a ~1.5x interpolation), but this node allows it.
|
| 77 |
+
|
| 78 |
+
Onto the workflow...
|
| 79 |
+
|
| 80 |
+
------------------------------------
|
| 81 |
+
|
| 82 |
+
This workflow handles two jobs:
|
| 83 |
+
-------------------------------
|
| 84 |
+
|
| 85 |
+
1. Fix WAN 2.2’s native 16fps output by interpolating it to 25fps with RIFE.
|
| 86 |
+
2. Generate synced audio with MMAudio using the final 25fps video.
|
| 87 |
+
|
| 88 |
+
The setup is plug-and-play. Drop in your WAN video → interpolate → feed it into MMAudio → get synced output. The included notes explain the reasoning for FPS, step settings, and seed behavior.
|
| 89 |
+
|
| 90 |
+
What this workflow covers:
|
| 91 |
+
|
| 92 |
+
1. RIFE interpolation from 16 → 25 fps.
|
| 93 |
+
2. MMAudio sampler
|
| 94 |
+
|
| 95 |
+
1. Upon some further testing, 50-100 steps works well. The node runs pretty fast in general, and it's also worthwhile toying with CFG (4.5 - 8). 100 steps and CFG 8 works well for high-quality output and better prompt adherence.
|
| 96 |
+
3. Automatic audio + video combine at 25fps.
|
| 97 |
+
4. Optional re-interpolation afterward if you want 30fps+ output.
|
| 98 |
+
|
| 99 |
+
1. You can plug your finished 25fps video into the 'Step 1: Rife Interpolation' group and just change the 'source\_fps' to 25 and the 'target\_fps' to 30.
|
| 100 |
+
|
| 101 |
+
**Required MMAudio files**
|
| 102 |
+
--------------------------
|
| 103 |
+
|
| 104 |
+
Download all of these into:
|
| 105 |
+
|
| 106 |
+
ComfyUI/models/mmaudio
|
| 107 |
+
|
| 108 |
+
**MMAudio NSFW Model (fine-tuned off the base model)**
|
| 109 |
+
|
| 110 |
+
<https://huggingface.co/phazei/NSFW_MMaudio/resolve/main/mmaudio_large_44k_nsfw_gold_8.5k_final_fp16.safetensors?download=true>
|
| 111 |
+
|
| 112 |
+
**MMAudio VAE (fp16)**
|
| 113 |
+
|
| 114 |
+
<https://huggingface.co/Kijai/MMAudio_safetensors/resolve/5984623e6b436818c6ff287ef6eec93e3e05aa3f/mmaudio_vae_44k_fp16.safetensors>
|
| 115 |
+
|
| 116 |
+
**MMAudio Synchformer (fp16)**
|
| 117 |
+
|
| 118 |
+
<https://huggingface.co/Kijai/MMAudio_safetensors/resolve/main/mmaudio_synchformer_fp16.safetensors>
|
| 119 |
+
|
| 120 |
+
**MMAudio CLIP Encoder (fp16)**
|
| 121 |
+
|
| 122 |
+
<https://huggingface.co/Kijai/MMAudio_safetensors/resolve/main/apple_DFN5B-CLIP-ViT-H-14-384_fp16.safetensors>
|
| 123 |
+
|
| 124 |
+
**Nvidia BigVGAN v2 24KHz 100band 512x**
|
| 125 |
+
|
| 126 |
+
This seems to be required for MMAudio to work. You can manually download all the files, git clone, or use the HuggingFace CLI tool (huggingface-cli repo clone URL). The repo should be placed in the ComfyUI/models/mmaudio folder.
|
| 127 |
+
|
| 128 |
+
<https://huggingface.co/nvidia/bigvgan_v2_44khz_128band_512x>
|
| 129 |
+
|
| 130 |
+
Bonus
|
| 131 |
+
-----
|
| 132 |
+
|
| 133 |
+
Once you've created a good MMAudio track, there are some further steps you can take depending on what you'd like to create.
|
| 134 |
+
|
| 135 |
+
1. Import your audio/video into some type of software (CapCut/Shotcut) and layer on some music in the background. I've done this with a few of my videos. I added a 'radio' filter to make it seem like the music was kinda tinny and playing in the background.
|
| 136 |
+
|
| 137 |
+
2. Layer other audio tracks alongside the NSFW audio track. You can see KaptainSisay very elegantly did something like that here (<https://civitai.com/images/110700679>)
|
| 138 |
+
|
| 139 |
+
---
|
| 140 |
+
|
| 141 |
Author: [SeoulSeeker](https://civitai.com/user/SeoulSeeker)
|
| 142 |
|
| 143 |
+
Model: [CivitAI Model Page](https://civitai.com/models/2137833?modelVersionId=2418378)
|
| 144 |
+
|
| 145 |
+
Archive: [CivArchive Page](https://civarchive.com/models/2137833?modelVersionId=2418378)
|
| 146 |
|
| 147 |
+
<!-- Version: 20260502_upload -->
|