File size: 4,441 Bytes
1f92e59
6f1adc3
 
 
 
 
 
 
1f92e59
 
6f1adc3
 
 
 
 
 
a046472
 
 
6f1adc3
 
 
 
 
a046472
6f1adc3
 
 
 
 
 
b5b1160
a046472
 
 
5011dc5
a046472
 
 
 
 
6f1adc3
5011dc5
 
 
 
 
 
 
 
bc09321
6f1adc3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
02929b5
 
 
 
 
 
 
 
6f1adc3
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
---
license: apache-2.0
tags:
- univa-agent
- video-generation
- generative-ai
- agent
- multimodal
---

# ๐Ÿš€ UniVA: Universal Video Agents towards Next-Generation Video Intelligence

**univa-agent** is a revolutionary video agent designed to provide an unprecedented interactive experience and high-quality video generation capabilities.

Our goal is not just to release a model, but to build a unified and powerful video creation platform.

## ๐Ÿ”— Project Ecosystem
`univa-agent` is a complete open-source project, including the following resources:

<p align="center">
    <a href="https://univa.online" target="_blank">
        <img src="https://img.shields.io/badge/๐Ÿ -Project_Page-orange.svg" alt="Project Page">
    </a>
    <a href="https://ngrok-univa.chrisprox599.workers.dev/" target="_blank">
        <img src="https://img.shields.io/badge/๐Ÿš€-Demo-rebeccapurple.svg" alt="Demo">
    </a>
    <a href="https://univa.online/asserts/pdf/UniVA_ICLR2025_v5.pdf" target="_blank">
        <img src="https://img.shields.io/badge/๐Ÿ“„-Paper-brightgreen.svg" alt="Paper">
    </a>
    <a href="https://github.com/univa-agent" target="_blank">
        <img src="https://img.shields.io/badge/๐Ÿ’ป-Code-blue.svg" alt="Code">
    </a><br>
    <a href="https://huggingface.co/datasets/UniVA-Agent/UniVA-Bench" target="_blank">
ย  ย  ย  ย  <img src="https://img.shields.io/badge/๐Ÿ“Š-Benchmark-yellow.svg" alt="Benchmark">
ย  ย  </a>
    <a href="https://huggingface.co/spaces/UniVA-Agent/UniVA-Leaderboard" target="_blank">
ย  ย  ย  ย  <img src="https://img.shields.io/badge/๐Ÿ†-Leaderboard-gold.svg" alt="Leaderboard">
ย  ย  </a>
    <a href="https://huggingface.co/spaces/UniVA-Agent/README/discussions" target="_blank">
ย  ย  ย  ย  <img src="https://img.shields.io/badge/๐Ÿ’ฌ-Discussions-blueviolet.svg" alt="Discussions">
ย  ย  </a>
</p>

## ๐Ÿš€ Try the Demo (Invitation-Only)

We provide an online Demo to quickly experience the powerful features of `univa-agent`.

Please note: **Demo access is currently by [Invitation-Only]**. We are committed to providing a stable, high-quality experience for our initial users.

* **โžก๏ธ [Click here to access the Demo](https://ngrok-univa.chrisprox599.workers.dev/)**
* **โžก๏ธ [Apply for Beta Access](https://forms.gle/X2dJSgfMy7WRJfAj7)**

## Core Features

The design of `univa-agent` is built on two core pillars, aiming to simultaneously address both the **generation quality** and the **creative experience** of video content.

### ๐ŸŒŸ Pillar 1: Unprecedented Interactive Experience

We believe the future of video creation should be interactive and intelligent. `univa-agent` introduces:

* **Unified Interaction System:** Handle multiple video tasks within a single, unified framework without needing to switch tools.
* **Agent with Memory:** Capable of understanding multi-turn conversational context to perform complex, stateful video editing and creation.
* **Deep Interaction Capabilities:** Supports fine-grained instructions, enabling comprehensive control from high-level concepts down to specific details.

### ๐ŸŽจ Pillar 2: High-Quality Generation Capabilities

A powerful agent must be matched with high-quality execution. `univa-agent` ensures:

* **Broad Task Support (Breadth):** Covers a wide range of functions, from text-to-video generation, video editing, and style transfer to video inpainting.
* **High-Fidelity Video Output:** Generated video content achieves state-of-the-art results in clarity, coherence, and visual aesthetics.
* **Powerful Function Map:** [Briefly describe 1-2 unique functional modules mentioned in your meeting, e.g., "Synergistic Components" or "Architecture Highlights"].


## ๐Ÿ‘ฅ Team

This project is developed by the UniVA team.
For detailed team member introductions, please visit our [Team Page](https://univa.online/#authors-section).

## โœ๏ธ How to Cite

If you find our work helpful for your research, please consider citing our paper:

```bibtex
@misc{liang2025univauniversalvideoagent,
      title={UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist}, 
      author={Zhengyang Liang and Daoan Zhang and Huichi Zhou and Rui Huang and Bobo Li and Yuechen Zhang and Shengqiong Wu and Xiaohan Wang and Jiebo Luo and Lizi Liao and Hao Fei},
      year={2025},
      eprint={2511.08521},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2511.08521}, 
}