---
license: apache-2.0
language:
- en
base_model:
- tencent/HunyuanVideo
- Qwen/Qwen2.5-VL-7B-Instruct
pipeline_tag: any-to-any
tags:
- video
---
**[Cong Wei
*,1,2](https://congwei1230.github.io/) **
**[Quande Liu
β ,2](https://liuquande.github.io/) **
**[Zixuan Ye
2](https://openreview.net/profile?id=~Zixuan_Ye1) **
**[Qiulin Wang
2](https://scholar.google.com/citations?user=3vvZdy8AAAAJ&hl=en) **
**[Xintao Wang
2](https://xinntao.github.io/)**
**[Pengfei Wan
2](https://magicwpf.github.io/) **
**[Kun Gai
2](https://openreview.net/profile?id=~Kun_Gai1) **
**[Wenhu Chen
β ,1](https://wenhuchen.github.io/)**
1University of Waterloo
2Kling Team, Kuaishou Technology
*Work done during an internship at Kling Team, Kuaishou Technology
β Corresponding author