minimax-remover / README.md
AEmotionStudio's picture
Upload README.md with huggingface_hub
0417404 verified
---
license: cc-by-nc-4.0
base_model: zibojia/minimax-remover
tags:
- video
- inpainting
- object-removal
- diffusers
- safetensors
---
# MiniMax-Remover — Video Object Removal
Mirror of [zibojia/minimax-remover](https://huggingface.co/zibojia/minimax-remover) for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA).
## What is MiniMax-Remover?
MiniMax-Remover is a purpose-built DiT (Diffusion Transformer) model for **video object removal**. Given a video and a binary mask indicating unwanted regions, the model inpaints the masked areas with temporally consistent content.
Key features:
- **81-frame native batch size** — processes up to 81 frames at once for temporal consistency
- **6–12 inference steps** — fast inference with iterative mask dilation
- **Simplified DiT architecture** — lightweight compared to general-purpose video editors (~2.5 GB)
## Files
```
transformer/diffusion_pytorch_model.safetensors (~2.25 GB)
vae/diffusion_pytorch_model.safetensors (~508 MB)
scheduler/scheduler_config.json
```
## Usage
### With ComfyUI-FFMPEGA (recommended)
1. Enable the `use_minimax_remover` toggle on the FFMPEG Agent node
2. Use `auto_mask:effect=remove` or select `minimax_remover` no-LLM mode
3. The model auto-downloads on first use
### Manual download
```bash
huggingface-cli download AEmotionStudio/minimax-remover --local-dir ./minimax_remover
```
### Programmatic
```python
from huggingface_hub import snapshot_download
snapshot_download(
repo_id="AEmotionStudio/minimax-remover",
allow_patterns=["vae/*", "transformer/*", "scheduler/*"],
local_dir="./minimax_remover"
)
```
## Removal Priority in FFMPEGA
When used with ComfyUI-FFMPEGA, MiniMax-Remover has the **highest priority** for object removal:
1. **MiniMax-Remover** (~2.5 GB VRAM) — when `use_minimax_remover=On`
2. **FLUX Klein 4B** (~15 GB VRAM) — when `use_flux_klein=On`
3. **LaMa** (~200 MB VRAM) — always available fallback
4. **Black fill** (0 VRAM) — FFmpeg fallback
## ⚠️ License
> **Non-Commercial Use Only**: Model weights are licensed under **CC-BY-NC-4.0**. The source code is Apache 2.0.
>
> Users must accept the non-commercial license terms when downloading. Commercial use of the model weights requires separate licensing from the original authors.
## Credits
- Original model by: [Jiaqi Zhu et al.](https://github.com/zibojia/MiniMax-Remover)
- Paper: *"MiniMax-Remover: Taming Bad Noise Helps Video Object Removal"*
- Upstream HuggingFace: [zibojia/minimax-remover](https://huggingface.co/zibojia/minimax-remover)
- Redistributed by: [Æmotion Studio](https://huggingface.co/AEmotionStudio) for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA)