XiangpengYang commited on
Commit
7a68c8c
·
1 Parent(s): 1d7168b
README.md CHANGED
@@ -3,7 +3,7 @@
3
  <font color="red"> VideoGrain: </font></center> <br>
4
  <center> Modulating Space-Time Attention for Multi-Grained Video Editing (ICLR 2025)
5
  </h2>
6
-
7
  ## [<a href="https://knightyxp.github.io/VideoGrain_project_page/" target="_blank">Project Page</a>]
8
 
9
  [![arXiv](https://img.shields.io/badge/arXiv-2502.17258-B31B1B.svg)](https://arxiv.org/abs/2502.17258)
@@ -11,26 +11,58 @@
11
  [![Project page](https://img.shields.io/badge/Project-Page-brightgreen)](https://knightyxp.github.io/VideoGrain_project_page/)
12
 
13
 
14
- <table class="center">
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
  <td><img src="assets/teaser/multi-grain-demo.gif"></td>
16
  <td><img src="assets/teaser/2monkeys.gif"></td>
17
  <tr>
18
  <td width=16% style="text-align:center;">Multi-Grained Video Editing</td>
19
- <td width=16% style="text-align:center;">"Class Level: human class → spiderman"</td>
20
- <td width=16% style="text-align:center;">"Instance Level: left → Spiderman, right → Polar Bear"</td>
21
- <td width=16% style="text-align:center;">"Part Level: Polar Bear + Sunglasses"</td>
22
- <td width=20% style="text-align:center;">"left → teddy bear, right → golden retriever"</td>
23
  </tr>
24
  <td><img src="assets/teaser/2cats.gif"></td>
25
  <td><img src="assets/teaser/soap-box.gif"></td>
26
- <td><img src="assets/teaser/man"></td>
27
  <tr>
28
- <td width=25% style="text-align:center;">"left cat→ Samoyed, right cat→ Tiger"</td>
29
- <td width=25% style="text-align:center;">"behind→ Iron Man, front→ Stormtrooper"</td>
30
- <td width=25% style="text-align:center;">"half-sleeve gray shirt→ a black suit"</td>
31
  </tr>
32
 
33
- </table >
34
 
35
  ## ▶️ Setup Environment
36
  Our method is tested using cuda12.1, fp16 of accelerator and xformers on a single L40.
 
3
  <font color="red"> VideoGrain: </font></center> <br>
4
  <center> Modulating Space-Time Attention for Multi-Grained Video Editing (ICLR 2025)
5
  </h2>
6
+ </div>
7
  ## [<a href="https://knightyxp.github.io/VideoGrain_project_page/" target="_blank">Project Page</a>]
8
 
9
  [![arXiv](https://img.shields.io/badge/arXiv-2502.17258-B31B1B.svg)](https://arxiv.org/abs/2502.17258)
 
11
  [![Project page](https://img.shields.io/badge/Project-Page-brightgreen)](https://knightyxp.github.io/VideoGrain_project_page/)
12
 
13
 
14
+
15
+
16
+ <table class="center" border="1" cellspacing="0" cellpadding="5">
17
+ <tr>
18
+ <td><img src="assets/teaser/run_two_man.gif" alt="Source Video"></td>
19
+ <td><img src="assets/teaser/class_level.gif" alt="Class Level"></td>
20
+ <td><img src="assets/teaser/instance_level.gif" alt="Instance Level"></td>
21
+ <td><img src="assets/teaser/part_level.gif" alt="Part Level"></td>
22
+ <td><img src="assets/teaser/2monkeys.gif" alt="Demo2 Image 1"></td>
23
+ <td><img src="assets/teaser/2monkeys_edit.gif" alt="Demo2 Image 2"></td>
24
+ </tr>
25
+ <tr>
26
+ <td style="text-align:center;"></td>
27
+ <td style="text-align:center;">Class Level: human class → spiderman</td>
28
+ <td style="text-align:center;">Instance Level: left → Spiderman, right → Polar Bear</td>
29
+ <td style="text-align:center;">Part Level: Polar Bear + Sunglasses</td>
30
+ <td colspan="2" style="text-align:center;">left → teddy bear, right → golden retriever</td>
31
+ </tr>
32
+
33
+ <tr>
34
+ <td colspan="2"><img src="assets/teaser/2cats.gif" alt="Demo3 Image 1"></td>
35
+ <td colspan="2"><img src="assets/teaser/soap-box.gif" alt="Demo3 Image 2"></td>
36
+ <td colspan="2"><img src="assets/teaser/man-text-message.gif" alt="Demo3 Image 3"></td>
37
+ </tr>
38
+ <tr>
39
+ <td colspan="2" style="text-align:center;">left cat → Samoyed, right cat → Tiger</td>
40
+ <td colspan="2" style="text-align:center;">behind → Iron Man, front → Stormtrooper</td>
41
+ <td colspan="2" style="text-align:center;">half-sleeve gray shirt → a black suit</td>
42
+ </tr>
43
+ </table>
44
+
45
+
46
+ <!-- <table class="center">
47
  <td><img src="assets/teaser/multi-grain-demo.gif"></td>
48
  <td><img src="assets/teaser/2monkeys.gif"></td>
49
  <tr>
50
  <td width=16% style="text-align:center;">Multi-Grained Video Editing</td>
51
+ <td width=16% style="text-align:center;">Class Level: human class → spiderman</td>
52
+ <td width=16% style="text-align:center;">Instance Level: left → Spiderman, right → Polar Bear</td>
53
+ <td width=16% style="text-align:center;">Part Level: Polar Bear + Sunglasses</td>
54
+ <td width=20% style="text-align:center;">left → teddy bear, right → golden retriever</td>
55
  </tr>
56
  <td><img src="assets/teaser/2cats.gif"></td>
57
  <td><img src="assets/teaser/soap-box.gif"></td>
58
+ <td><img src="assets/teaser/man-text-message.gif"></td>
59
  <tr>
60
+ <td width=25% style="text-align:center;">left cat→ Samoyed, right cat→ Tiger</td>
61
+ <td width=25% style="text-align:center;">behind→ Iron Man, front→ Stormtrooper</td>
62
+ <td width=25% style="text-align:center;">half-sleeve gray shirt→ a black sui</td>
63
  </tr>
64
 
65
+ </table > -->
66
 
67
  ## ▶️ Setup Environment
68
  Our method is tested using cuda12.1, fp16 of accelerator and xformers on a single L40.
assets/teaser/2monkeys.gif CHANGED

Git LFS Details

  • SHA256: fa2ac46b3dba64364be8664338462865252a2e2e7eadd4c0fe708b16ea1ea758
  • Pointer size: 132 Bytes
  • Size of remote file: 6.84 MB

Git LFS Details

  • SHA256: 541be7dd2d57e0a18683b5c334da8e1c73c0fb58768346b596c7c3d2a8a812ee
  • Pointer size: 132 Bytes
  • Size of remote file: 3.84 MB
assets/teaser/2monkeys_edit.gif ADDED

Git LFS Details

  • SHA256: 42752b70f94d3910c970ded8f040655f071342f0c523810699f8f4b93b823fdf
  • Pointer size: 132 Bytes
  • Size of remote file: 3.13 MB
assets/teaser/class_level.gif ADDED

Git LFS Details

  • SHA256: 7ade31f4554dc090a4987887d2a9af7a88d643c2d165018b7c7ed4ab3d5ed984
  • Pointer size: 132 Bytes
  • Size of remote file: 2.35 MB
assets/teaser/instance_level.gif ADDED

Git LFS Details

  • SHA256: bb1b4afe4e7716876eeff060283d917491f630158ff6b10808b710152347ba98
  • Pointer size: 132 Bytes
  • Size of remote file: 2.68 MB
assets/teaser/part_level.gif ADDED

Git LFS Details

  • SHA256: a859a4ca63ee4707bb55c0b0e887726ddcf0bdaac0ddd63ae9bf7a4eda60a3c9
  • Pointer size: 132 Bytes
  • Size of remote file: 2.6 MB
assets/teaser/run_two_man.gif ADDED

Git LFS Details

  • SHA256: ed0dff7d0d5c58fcc489ef32edfff3307b08d62856e4e2646b1fdfbeb7f3205b
  • Pointer size: 132 Bytes
  • Size of remote file: 2.47 MB