Sentinel7
/

wan

Model card Files Files and versions

xet

Sentinel7 commited on 14 days ago

Commit

4048057

verified ·

1 Parent(s): 35aec3f

Upload 2137833/2418378/README.md with huggingface_hub

Browse files

Files changed (1) hide show

2137833/2418378/README.md +140 -4

2137833/2418378/README.md CHANGED Viewed

@@ -1,11 +1,147 @@
 ---
 license: other
 tags:
-- civitai
 ---
 Author: [SeoulSeeker](https://civitai.com/user/SeoulSeeker)
-Model: [https://civitai.com/models/2137833?modelVersionId=2418378](https://civitai.com/models/2137833?modelVersionId=2418378)
-Mirror: [https://civarchive.com/models/2137833?modelVersionId=2418378](https://civarchive.com/models/2137833?modelVersionId=2418378)

 ---
 license: other
 tags:
+- tool
 ---
+# (NSFW) Dead-Simple MMAudio + RIFE Interpolation Setup for WAN 2.2 I2V 14B - v1.0.1 - 2418378
+**Model Type**: Workflows
+**Base Model**: Wan Video 2.2 I2V-A14B
+**Trigger Words**: None
+**Tags**: tool
+## Gallery
+<table>
+  <tr>
+    <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/2d70c3cc-ae0b-4c29-814f-15ccbd702bfd/original=true/110680919.mp4" width="200" controls muted autoplay loop></video></td>
+    <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/1bb32080-5f21-4a02-b0d9-118161840166/original=true/110681031.mp4" width="200" controls muted autoplay loop></video></td>
+    <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/7f59ad48-0d2f-4305-90df-3b74d37ecbf4/original=true/110681469.mp4" width="200" controls muted autoplay loop></video></td>
+  </tr>
+  <tr>
+    <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/a5836950-54b0-4df8-8971-33201617739f/original=true/110681546.mp4" width="200" controls muted autoplay loop></video></td>
+    <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/119cbc76-070e-4f42-9029-588b27ed402e/original=true/110814229.mp4" width="200" controls muted autoplay loop></video></td>
+    <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/08b9bd8a-476d-46df-bd81-75c9a4cc6b42/original=true/111285113.mp4" width="200" controls muted autoplay loop></video></td>
+  </tr>
+  <tr>
+    <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/94dd0779-af13-408e-b2ad-0d465599e2cc/original=true/111285242.mp4" width="200" controls muted autoplay loop></video></td>
+    <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/b85e85d7-e037-4723-a064-31b75a81355d/original=true/111285628.mp4" width="200" controls muted autoplay loop></video></td>
+    <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/c710cd5a-4ef4-4ab0-a90b-802674d76ea4/original=true/111285699.mp4" width="200" controls muted autoplay loop></video></td>
+  </tr>
+  <tr>
+    <td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/514d1e68-d9d8-49c8-8e74-3e56a5310546/original=true/111285744.mp4" width="200" controls muted autoplay loop></video></td>
+  </tr>
+</table>
+## Description
+Changelog
+**Version 1.0.3**: Connected both steps so no more re-uploading is required. Just upload your video in Step 1 and hit Run.
+**Version 1.0.2:** Changed VHS nodes to VHS ffmpeg nodes to avoid color drift (thank you LastAssignment). Also changed FPS flow from 24 to 25 to more closely align to MMAudio specs.
+**Version 1.0.1**: RIFE Group output was set to 8fps by accident. Changed it to 24fps
+**Version 1.0**: Initial release
+**A TRIBUTE TO GOONERS EVERYWHERE**
+### Your WAN 2.2 video is great. It looks awesome. But where's the sound? We moved from images to videos, and WAN 2.2 is incredible for video. The missing piece...AUDIO!
+This is my first article ever, so I'm sorry if I made any mistakes. Please leave a comment if I've made an error or if you need any help. For your reference, I'm running:
+* ComfyUI 0.3.68
+* Torch 2.9
+* CUDA 13
+* Python 3.13.9
+* Sage Attention 2.2
+* NVIDIA 5070 Ti (16gb vram)
+And here are the custom nodes (3 in total):
+* **ComfyUI-VideoHelperSuite** 1.7.7 (<https://github.com/Kosinkadink/ComfyUI-VideoHelperSuite>)
+* **ComfyUI-MMAudio** Nightly (<https://github.com/kijai/ComfyUI-MMAudio>)
+  + I recommend manually git cloning this node pack into your /ComfyUI/models/custom\_nodes folder and then installing the requirements.txt file using your embedded python. I'm on portable Comfy, so the command would look something like this:
+    - "C:\ComfyUI\python\_embeded\python.exe" -m pip install -r "C:\ComfyUI\ComfyUI\custom\_nodes\ComfyUI-MMAudio\requirements.txt"
+* **ComfyUI-VFI** Unknown (<https://github.com/GACLove/ComfyUI-VFI>)
+  + I think there's a more popular RIFE custom node that a lot of other people use, but Icouldn't figure out how to get fractional multiples for interpolation (16 -> 25fps is a ~1.5x interpolation), but this node allows it.
+Onto the workflow...
+------------------------------------
+This workflow handles two jobs:
+-------------------------------
+1. Fix WAN 2.2’s native 16fps output by interpolating it to 25fps with RIFE.
+2. Generate synced audio with MMAudio using the final 25fps video.
+The setup is plug-and-play. Drop in your WAN video → interpolate → feed it into MMAudio → get synced output. The included notes explain the reasoning for FPS, step settings, and seed behavior.
+What this workflow covers:
+1. RIFE interpolation from 16 → 25 fps.
+2. MMAudio sampler
+   1. Upon some further testing, 50-100 steps works well. The node runs pretty fast in general, and it's also worthwhile toying with CFG (4.5 - 8). 100 steps and CFG 8 works well for high-quality output and better prompt adherence.
+3. Automatic audio + video combine at 25fps.
+4. Optional re-interpolation afterward if you want 30fps+ output.
+   1. You can plug your finished 25fps video into the 'Step 1: Rife Interpolation' group and just change the 'source\_fps' to 25 and the 'target\_fps' to 30.
+**Required MMAudio files**
+--------------------------
+Download all of these into:
+ComfyUI/models/mmaudio
+**MMAudio NSFW Model (fine-tuned off the base model)**
+<https://huggingface.co/phazei/NSFW_MMaudio/resolve/main/mmaudio_large_44k_nsfw_gold_8.5k_final_fp16.safetensors?download=true>
+**MMAudio VAE (fp16)**
+<https://huggingface.co/Kijai/MMAudio_safetensors/resolve/5984623e6b436818c6ff287ef6eec93e3e05aa3f/mmaudio_vae_44k_fp16.safetensors>
+**MMAudio Synchformer (fp16)**
+<https://huggingface.co/Kijai/MMAudio_safetensors/resolve/main/mmaudio_synchformer_fp16.safetensors>
+**MMAudio CLIP Encoder (fp16)**
+<https://huggingface.co/Kijai/MMAudio_safetensors/resolve/main/apple_DFN5B-CLIP-ViT-H-14-384_fp16.safetensors>
+**Nvidia BigVGAN v2 24KHz 100band 512x**
+This seems to be required for MMAudio to work. You can manually download all the files, git clone, or use the HuggingFace CLI tool (huggingface-cli repo clone URL). The repo should be placed in the ComfyUI/models/mmaudio folder.
+<https://huggingface.co/nvidia/bigvgan_v2_44khz_128band_512x>
+Bonus
+-----
+Once you've created a good MMAudio track, there are some further steps you can take depending on what you'd like to create.
+1. Import your audio/video into some type of software (CapCut/Shotcut) and layer on some music in the background. I've done this with a few of my videos. I added a 'radio' filter to make it seem like the music was kinda tinny and playing in the background.
+2. Layer other audio tracks alongside the NSFW audio track. You can see KaptainSisay very elegantly did something like that here (<https://civitai.com/images/110700679>)
+---
 Author: [SeoulSeeker](https://civitai.com/user/SeoulSeeker)
+Model: [CivitAI Model Page](https://civitai.com/models/2137833?modelVersionId=2418378)
+Archive: [CivArchive Page](https://civarchive.com/models/2137833?modelVersionId=2418378)
+<!-- Version: 20260502_upload -->