Add pipeline tag, paper link, and citation
#7
by
nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,6 +1,7 @@
|
|
| 1 |
---
|
| 2 |
library_name: diffusers
|
| 3 |
license: apache-2.0
|
|
|
|
| 4 |
tags:
|
| 5 |
- image-to-video
|
| 6 |
- image-text-to-video
|
|
@@ -39,10 +40,11 @@ MOVA addresses the limitations of proprietary systems like Sora 2 and Veo 3 by o
|
|
| 39 |
### Model Sources
|
| 40 |
|
| 41 |
- **Github:** https://github.com/OpenMOSS/MOVA
|
| 42 |
-
- **Paper:**
|
|
|
|
| 43 |
|
| 44 |
### Model Usage
|
| 45 |
-
Please refer to the github page for model usage.
|
| 46 |
|
| 47 |
## Evaluation
|
| 48 |
We evaluate our model through both objective benchmarks and subjective human evaluations. Below are the Elo scores and win rates comparing MOVA to existing open-source models.
|
|
@@ -55,3 +57,13 @@ We evaluate our model through both objective benchmarks and subjective human eva
|
|
| 55 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/64817b8550b759c75d5d1eeb/i5lgZI3NmxLXdJIxndcOp.png" width="1000"/>
|
| 56 |
<p>
|
| 57 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
library_name: diffusers
|
| 3 |
license: apache-2.0
|
| 4 |
+
pipeline_tag: any-to-any
|
| 5 |
tags:
|
| 6 |
- image-to-video
|
| 7 |
- image-text-to-video
|
|
|
|
| 40 |
### Model Sources
|
| 41 |
|
| 42 |
- **Github:** https://github.com/OpenMOSS/MOVA
|
| 43 |
+
- **Paper:** [MOVA: Towards Scalable and Synchronized Video-Audio Generation](https://huggingface.co/papers/2602.08794)
|
| 44 |
+
- **Project Page:** https://mosi.cn/models/mova
|
| 45 |
|
| 46 |
### Model Usage
|
| 47 |
+
Please refer to the [Quick Start](https://github.com/OpenMOSS/MOVA#quick-start) section on the GitHub page for model usage and inference scripts.
|
| 48 |
|
| 49 |
## Evaluation
|
| 50 |
We evaluate our model through both objective benchmarks and subjective human evaluations. Below are the Elo scores and win rates comparing MOVA to existing open-source models.
|
|
|
|
| 57 |
<img src="https://cdn-uploads.huggingface.co/production/uploads/64817b8550b759c75d5d1eeb/i5lgZI3NmxLXdJIxndcOp.png" width="1000"/>
|
| 58 |
<p>
|
| 59 |
|
| 60 |
+
## Citation
|
| 61 |
+
|
| 62 |
+
```bibtex
|
| 63 |
+
@article{yu2026mova,
|
| 64 |
+
title={MOVA: Towards Scalable and Synchronized Video-Audio Generation},
|
| 65 |
+
author={Yu, Donghua and Chen, Mingshu and Chen, Qi and Luo, Qi and Wu, Qianyi and Cheng, Qinyuan and Li, Ruixiao and Liang, Tianyi and Zhang, Wenbo and Tu, Wenming and others},
|
| 66 |
+
journal={arXiv preprint arXiv:2602.08794},
|
| 67 |
+
year={2026}
|
| 68 |
+
}
|
| 69 |
+
```
|