AEmotionStudio
/

minimax-remover

Model card Files Files and versions

minimax-remover / README.md

AEmotionStudio's picture

Upload README.md with huggingface_hub

0417404 verified 3 days ago

|

history blame contribute delete

2.78 kB

	---
	license: cc-by-nc-4.0
	base_model: zibojia/minimax-remover
	tags:
	- video
	- inpainting
	- object-removal
	- diffusers
	- safetensors
	---

	# MiniMax-Remover — Video Object Removal

	Mirror of [zibojia/minimax-remover](https://huggingface.co/zibojia/minimax-remover) for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA).

	## What is MiniMax-Remover?

	MiniMax-Remover is a purpose-built DiT (Diffusion Transformer) model for video object removal. Given a video and a binary mask indicating unwanted regions, the model inpaints the masked areas with temporally consistent content.

	Key features:
	- 81-frame native batch size — processes up to 81 frames at once for temporal consistency
	- 6–12 inference steps — fast inference with iterative mask dilation
	- Simplified DiT architecture — lightweight compared to general-purpose video editors (~2.5 GB)

	## Files

	```
	transformer/diffusion_pytorch_model.safetensors (~2.25 GB)
	vae/diffusion_pytorch_model.safetensors (~508 MB)
	scheduler/scheduler_config.json
	```

	## Usage

	### With ComfyUI-FFMPEGA (recommended)

	1. Enable the `use_minimax_remover` toggle on the FFMPEG Agent node
	2. Use `auto_mask:effect=remove` or select `minimax_remover` no-LLM mode
	3. The model auto-downloads on first use

	### Manual download

	```bash
	huggingface-cli download AEmotionStudio/minimax-remover --local-dir ./minimax_remover
	```

	### Programmatic

	```python
	from huggingface_hub import snapshot_download

	snapshot_download(
	repo_id="AEmotionStudio/minimax-remover",
	allow_patterns=["vae/", "transformer/", "scheduler/*"],
	local_dir="./minimax_remover"
	)
	```

	## Removal Priority in FFMPEGA

	When used with ComfyUI-FFMPEGA, MiniMax-Remover has the highest priority for object removal:

	1. MiniMax-Remover (~2.5 GB VRAM) — when `use_minimax_remover=On`
	2. FLUX Klein 4B (~15 GB VRAM) — when `use_flux_klein=On`
	3. LaMa (~200 MB VRAM) — always available fallback
	4. Black fill (0 VRAM) — FFmpeg fallback

	## ⚠️ License

	> Non-Commercial Use Only: Model weights are licensed under CC-BY-NC-4.0. The source code is Apache 2.0.
	>
	> Users must accept the non-commercial license terms when downloading. Commercial use of the model weights requires separate licensing from the original authors.

	## Credits

	- Original model by: [Jiaqi Zhu et al.](https://github.com/zibojia/MiniMax-Remover)
	- Paper: "MiniMax-Remover: Taming Bad Noise Helps Video Object Removal"
	- Upstream HuggingFace: [zibojia/minimax-remover](https://huggingface.co/zibojia/minimax-remover)
	- Redistributed by: [Æmotion Studio](https://huggingface.co/AEmotionStudio) for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA)