Add model card for AVI-Edit component
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,4 +1,34 @@
|
|
| 1 |
---
|
| 2 |
license: mit
|
|
|
|
| 3 |
---
|
| 4 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: mit
|
| 3 |
+
pipeline_tag: audio-to-audio
|
| 4 |
---
|
| 5 |
+
|
| 6 |
+
# MVSEP for AVI-Edit
|
| 7 |
+
|
| 8 |
+
This repository contains the model weights for cinematic sound demixing used as a component in the **AVI-Edit** framework, as presented in the paper [AVI-Edit: Audio-sync Video Instance Editing with Granularity-Aware Mask Refiner](https://arxiv.org/abs/2512.10571).
|
| 9 |
+
|
| 10 |
+
[**Project Page**](https://hjzheng.net/projects/AVI-Edit/) | [**Github Repository**](https://github.com/suimuc/AVI-Edit-Framework) | [**Paper**](https://arxiv.org/abs/2512.10571)
|
| 11 |
+
|
| 12 |
+
## Description
|
| 13 |
+
AVI-Edit is a framework for audio-synchronous video instance editing. It utilizes a granularity-aware mask refiner to iteratively transform coarse user-provided masks into precise instance-level regions, and a self-feedback audio agent to curate high-quality audio guidance. These specific weights are utilized for the audio demixing tasks required by the framework's processing pipeline.
|
| 14 |
+
|
| 15 |
+
The weights were originally sourced from the [MVSEP-CDX23-Cinematic-Sound-Demixing](https://github.com/ZFTurbo/MVSEP-CDX23-Cinematic-Sound-Demixing) project.
|
| 16 |
+
|
| 17 |
+
## Usage
|
| 18 |
+
These weights are designed to be used within the AVI-Edit framework. You can download them to the expected local directory using the following command:
|
| 19 |
+
|
| 20 |
+
```bash
|
| 21 |
+
hf download suimu/MVSEP --local-dir model_weights/MVSEP
|
| 22 |
+
```
|
| 23 |
+
|
| 24 |
+
For more detailed setup and inference instructions, please refer to the [AVI-Edit-Framework GitHub repository](https://github.com/suimuc/AVI-Edit-Framework).
|
| 25 |
+
|
| 26 |
+
## Citation
|
| 27 |
+
```bibtex
|
| 28 |
+
@article{avi-edit,
|
| 29 |
+
title={Audio-sync Video Instance Editing with Granularity-Aware Mask Refiner},
|
| 30 |
+
author={Zheng, Haojie and Weng, Shuchen and Liu, Jingqi and Yang, Siqi and Shi, Boxin and Wang, Xinlong},
|
| 31 |
+
journal={arXiv preprint arXiv:2512.10571},
|
| 32 |
+
year={2025}
|
| 33 |
+
}
|
| 34 |
+
```
|