XiangpengYang commited on
Commit
1d7168b
·
1 Parent(s): 1f3a26b
README.md CHANGED
@@ -1,10 +1,37 @@
1
- # VideoGrain: Modulating Space-Time Attention for Multi-Grained Video Editing (ICLR 2025)
 
 
 
 
 
2
  ## [<a href="https://knightyxp.github.io/VideoGrain_project_page/" target="_blank">Project Page</a>]
3
 
4
  [![arXiv](https://img.shields.io/badge/arXiv-2502.17258-B31B1B.svg)](https://arxiv.org/abs/2502.17258)
5
  [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/papers/2502.17258)
6
  [![Project page](https://img.shields.io/badge/Project-Page-brightgreen)](https://knightyxp.github.io/VideoGrain_project_page/)
7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  ## ▶️ Setup Environment
9
  Our method is tested using cuda12.1, fp16 of accelerator and xformers on a single L40.
10
 
 
1
+ <div align="center">Multi-grained Video Editing
2
+ <h2>
3
+ <font color="red"> VideoGrain: </font></center> <br>
4
+ <center> Modulating Space-Time Attention for Multi-Grained Video Editing (ICLR 2025)
5
+ </h2>
6
+
7
  ## [<a href="https://knightyxp.github.io/VideoGrain_project_page/" target="_blank">Project Page</a>]
8
 
9
  [![arXiv](https://img.shields.io/badge/arXiv-2502.17258-B31B1B.svg)](https://arxiv.org/abs/2502.17258)
10
  [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/papers/2502.17258)
11
  [![Project page](https://img.shields.io/badge/Project-Page-brightgreen)](https://knightyxp.github.io/VideoGrain_project_page/)
12
 
13
+
14
+ <table class="center">
15
+ <td><img src="assets/teaser/multi-grain-demo.gif"></td>
16
+ <td><img src="assets/teaser/2monkeys.gif"></td>
17
+ <tr>
18
+ <td width=16% style="text-align:center;">Multi-Grained Video Editing</td>
19
+ <td width=16% style="text-align:center;">"Class Level: human class → spiderman"</td>
20
+ <td width=16% style="text-align:center;">"Instance Level: left → Spiderman, right → Polar Bear"</td>
21
+ <td width=16% style="text-align:center;">"Part Level: Polar Bear + Sunglasses"</td>
22
+ <td width=20% style="text-align:center;">"left → teddy bear, right → golden retriever"</td>
23
+ </tr>
24
+ <td><img src="assets/teaser/2cats.gif"></td>
25
+ <td><img src="assets/teaser/soap-box.gif"></td>
26
+ <td><img src="assets/teaser/man"></td>
27
+ <tr>
28
+ <td width=25% style="text-align:center;">"left cat→ Samoyed, right cat→ Tiger"</td>
29
+ <td width=25% style="text-align:center;">"behind→ Iron Man, front→ Stormtrooper"</td>
30
+ <td width=25% style="text-align:center;">"half-sleeve gray shirt→ a black suit"</td>
31
+ </tr>
32
+
33
+ </table >
34
+
35
  ## ▶️ Setup Environment
36
  Our method is tested using cuda12.1, fp16 of accelerator and xformers on a single L40.
37
 
assets/teaser/2cats.gif ADDED

Git LFS Details

  • SHA256: 22d760b8d9c1e58ab76b8c20f566bdc8bd49d0c32406aafcefd9fb66f92a3db7
  • Pointer size: 132 Bytes
  • Size of remote file: 5.57 MB
assets/teaser/2monkeys.gif ADDED

Git LFS Details

  • SHA256: fa2ac46b3dba64364be8664338462865252a2e2e7eadd4c0fe708b16ea1ea758
  • Pointer size: 132 Bytes
  • Size of remote file: 6.84 MB
assets/teaser/man-text-message.gif ADDED

Git LFS Details

  • SHA256: 420c657c88c9ba30e91cadbfb27577725e791cc1720907b12745d65d2c73f66d
  • Pointer size: 132 Bytes
  • Size of remote file: 5.02 MB
assets/teaser/multi-grain-demo.gif ADDED

Git LFS Details

  • SHA256: 7a7998a1b2e40b328d4d6837fbdf9ec2c9ee1d42b86d60b9216ec2aae0381d1e
  • Pointer size: 132 Bytes
  • Size of remote file: 6.77 MB
assets/teaser/soap-box.gif ADDED

Git LFS Details

  • SHA256: ffda15caa855e81b65ed4b25686ea290a4c7baca75ebc72e42eb03958994175d
  • Pointer size: 132 Bytes
  • Size of remote file: 4.51 MB