Upload 2137833/2418211/README.md with huggingface_hub
Browse files- 2137833/2418211/README.md +131 -4
2137833/2418211/README.md
CHANGED
|
@@ -1,11 +1,138 @@
|
|
| 1 |
-
|
| 2 |
---
|
| 3 |
license: other
|
| 4 |
tags:
|
| 5 |
-
-
|
| 6 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
Author: [SeoulSeeker](https://civitai.com/user/SeoulSeeker)
|
| 8 |
|
| 9 |
-
Model: [
|
|
|
|
|
|
|
| 10 |
|
| 11 |
-
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: other
|
| 3 |
tags:
|
| 4 |
+
- tool
|
| 5 |
---
|
| 6 |
+
|
| 7 |
+
# (NSFW) Dead-Simple MMAudio + RIFE Interpolation Setup for WAN 2.2 I2V 14B - v1.0 - 2418211
|
| 8 |
+
|
| 9 |
+
**Model Type**: Workflows
|
| 10 |
+
|
| 11 |
+
**Base Model**: Wan Video 2.2 I2V-A14B
|
| 12 |
+
|
| 13 |
+
**Trigger Words**: None
|
| 14 |
+
|
| 15 |
+
**Tags**: tool
|
| 16 |
+
|
| 17 |
+
## Gallery
|
| 18 |
+
|
| 19 |
+
<table>
|
| 20 |
+
<tr>
|
| 21 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/5484067a-b02d-4fe0-aa0f-eb0a00e5336f/original=true/110674397.mp4" width="200" controls muted autoplay loop></video></td>
|
| 22 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/4e8caf60-87d9-46c2-8030-1879c67fb978/original=true/110674319.mp4" width="200" controls muted autoplay loop></video></td>
|
| 23 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/eb1b33e1-c864-4341-b861-851e9e8ae4c0/original=true/110674260.mp4" width="200" controls muted autoplay loop></video></td>
|
| 24 |
+
</tr>
|
| 25 |
+
<tr>
|
| 26 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/a7430f7d-bf4d-4297-894e-fc9e67a86b89/original=true/110674366.mp4" width="200" controls muted autoplay loop></video></td>
|
| 27 |
+
<td><video src="https://image.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/b6793911-38a1-453d-96eb-60f4e9333d97/original=true/110674376.mp4" width="200" controls muted autoplay loop></video></td>
|
| 28 |
+
</tr>
|
| 29 |
+
</table>
|
| 30 |
+
|
| 31 |
+
## Description
|
| 32 |
+
|
| 33 |
+
Changelog
|
| 34 |
+
|
| 35 |
+
**Version 1.0.3**: Connected both steps so no more re-uploading is required. Just upload your video in Step 1 and hit Run.
|
| 36 |
+
|
| 37 |
+
**Version 1.0.2:** Changed VHS nodes to VHS ffmpeg nodes to avoid color drift (thank you LastAssignment). Also changed FPS flow from 24 to 25 to more closely align to MMAudio specs.
|
| 38 |
+
|
| 39 |
+
**Version 1.0.1**: RIFE Group output was set to 8fps by accident. Changed it to 24fps
|
| 40 |
+
|
| 41 |
+
**Version 1.0**: Initial release
|
| 42 |
+
|
| 43 |
+
**A TRIBUTE TO GOONERS EVERYWHERE**
|
| 44 |
+
|
| 45 |
+
### Your WAN 2.2 video is great. It looks awesome. But where's the sound? We moved from images to videos, and WAN 2.2 is incredible for video. The missing piece...AUDIO!
|
| 46 |
+
|
| 47 |
+
This is my first article ever, so I'm sorry if I made any mistakes. Please leave a comment if I've made an error or if you need any help. For your reference, I'm running:
|
| 48 |
+
|
| 49 |
+
* ComfyUI 0.3.68
|
| 50 |
+
* Torch 2.9
|
| 51 |
+
* CUDA 13
|
| 52 |
+
* Python 3.13.9
|
| 53 |
+
* Sage Attention 2.2
|
| 54 |
+
* NVIDIA 5070 Ti (16gb vram)
|
| 55 |
+
|
| 56 |
+
And here are the custom nodes (3 in total):
|
| 57 |
+
|
| 58 |
+
* **ComfyUI-VideoHelperSuite** 1.7.7 (<https://github.com/Kosinkadink/ComfyUI-VideoHelperSuite>)
|
| 59 |
+
* **ComfyUI-MMAudio** Nightly (<https://github.com/kijai/ComfyUI-MMAudio>)
|
| 60 |
+
|
| 61 |
+
+ I recommend manually git cloning this node pack into your /ComfyUI/models/custom\_nodes folder and then installing the requirements.txt file using your embedded python. I'm on portable Comfy, so the command would look something like this:
|
| 62 |
+
|
| 63 |
+
- "C:\ComfyUI\python\_embeded\python.exe" -m pip install -r "C:\ComfyUI\ComfyUI\custom\_nodes\ComfyUI-MMAudio\requirements.txt"
|
| 64 |
+
|
| 65 |
+
* **ComfyUI-VFI** Unknown (<https://github.com/GACLove/ComfyUI-VFI>)
|
| 66 |
+
|
| 67 |
+
+ I think there's a more popular RIFE custom node that a lot of other people use, but Icouldn't figure out how to get fractional multiples for interpolation (16 -> 25fps is a ~1.5x interpolation), but this node allows it.
|
| 68 |
+
|
| 69 |
+
Onto the workflow...
|
| 70 |
+
|
| 71 |
+
------------------------------------
|
| 72 |
+
|
| 73 |
+
This workflow handles two jobs:
|
| 74 |
+
-------------------------------
|
| 75 |
+
|
| 76 |
+
1. Fix WAN 2.2’s native 16fps output by interpolating it to 25fps with RIFE.
|
| 77 |
+
2. Generate synced audio with MMAudio using the final 25fps video.
|
| 78 |
+
|
| 79 |
+
The setup is plug-and-play. Drop in your WAN video → interpolate → feed it into MMAudio → get synced output. The included notes explain the reasoning for FPS, step settings, and seed behavior.
|
| 80 |
+
|
| 81 |
+
What this workflow covers:
|
| 82 |
+
|
| 83 |
+
1. RIFE interpolation from 16 → 25 fps.
|
| 84 |
+
2. MMAudio sampler
|
| 85 |
+
|
| 86 |
+
1. Upon some further testing, 50-100 steps works well. The node runs pretty fast in general, and it's also worthwhile toying with CFG (4.5 - 8). 100 steps and CFG 8 works well for high-quality output and better prompt adherence.
|
| 87 |
+
3. Automatic audio + video combine at 25fps.
|
| 88 |
+
4. Optional re-interpolation afterward if you want 30fps+ output.
|
| 89 |
+
|
| 90 |
+
1. You can plug your finished 25fps video into the 'Step 1: Rife Interpolation' group and just change the 'source\_fps' to 25 and the 'target\_fps' to 30.
|
| 91 |
+
|
| 92 |
+
**Required MMAudio files**
|
| 93 |
+
--------------------------
|
| 94 |
+
|
| 95 |
+
Download all of these into:
|
| 96 |
+
|
| 97 |
+
ComfyUI/models/mmaudio
|
| 98 |
+
|
| 99 |
+
**MMAudio NSFW Model (fine-tuned off the base model)**
|
| 100 |
+
|
| 101 |
+
<https://huggingface.co/phazei/NSFW_MMaudio/resolve/main/mmaudio_large_44k_nsfw_gold_8.5k_final_fp16.safetensors?download=true>
|
| 102 |
+
|
| 103 |
+
**MMAudio VAE (fp16)**
|
| 104 |
+
|
| 105 |
+
<https://huggingface.co/Kijai/MMAudio_safetensors/resolve/5984623e6b436818c6ff287ef6eec93e3e05aa3f/mmaudio_vae_44k_fp16.safetensors>
|
| 106 |
+
|
| 107 |
+
**MMAudio Synchformer (fp16)**
|
| 108 |
+
|
| 109 |
+
<https://huggingface.co/Kijai/MMAudio_safetensors/resolve/main/mmaudio_synchformer_fp16.safetensors>
|
| 110 |
+
|
| 111 |
+
**MMAudio CLIP Encoder (fp16)**
|
| 112 |
+
|
| 113 |
+
<https://huggingface.co/Kijai/MMAudio_safetensors/resolve/main/apple_DFN5B-CLIP-ViT-H-14-384_fp16.safetensors>
|
| 114 |
+
|
| 115 |
+
**Nvidia BigVGAN v2 24KHz 100band 512x**
|
| 116 |
+
|
| 117 |
+
This seems to be required for MMAudio to work. You can manually download all the files, git clone, or use the HuggingFace CLI tool (huggingface-cli repo clone URL). The repo should be placed in the ComfyUI/models/mmaudio folder.
|
| 118 |
+
|
| 119 |
+
<https://huggingface.co/nvidia/bigvgan_v2_44khz_128band_512x>
|
| 120 |
+
|
| 121 |
+
Bonus
|
| 122 |
+
-----
|
| 123 |
+
|
| 124 |
+
Once you've created a good MMAudio track, there are some further steps you can take depending on what you'd like to create.
|
| 125 |
+
|
| 126 |
+
1. Import your audio/video into some type of software (CapCut/Shotcut) and layer on some music in the background. I've done this with a few of my videos. I added a 'radio' filter to make it seem like the music was kinda tinny and playing in the background.
|
| 127 |
+
|
| 128 |
+
2. Layer other audio tracks alongside the NSFW audio track. You can see KaptainSisay very elegantly did something like that here (<https://civitai.com/images/110700679>)
|
| 129 |
+
|
| 130 |
+
---
|
| 131 |
+
|
| 132 |
Author: [SeoulSeeker](https://civitai.com/user/SeoulSeeker)
|
| 133 |
|
| 134 |
+
Model: [CivitAI Model Page](https://civitai.com/models/2137833?modelVersionId=2418211)
|
| 135 |
+
|
| 136 |
+
Archive: [CivArchive Page](https://civarchive.com/models/2137833?modelVersionId=2418211)
|
| 137 |
|
| 138 |
+
<!-- Version: 20260502_upload -->
|