File size: 5,290 Bytes
9c1b5ea
 
daab846
9c1b5ea
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
daab846
 
 
 
 
 
 
 
 
 
9c1b5ea
 
daab846
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ce2b4b7
 
 
daab846
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9c1b5ea
ce2b4b7
 
 
9c1b5ea
daab846
9c1b5ea
daab846
9c1b5ea
daab846
9c1b5ea
daab846
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
---
tags:
- text-to-video
- lora
- diffusers
- template:diffusion-lora
widget:
- output:
    url: images/video-0001.png
  text: >-
    Stunning panoramic underwater shot of a vibrant coral reef ecosystem
    brimming with marine life. Colorful fish dart effortlessly among intricate
    coral formations, soft rays of sunlight filter through the crystal-clear
    waters, creating mesmerizing patterns on the ocean floor. Wide-angle
    capturing vivid hues and abundant biodiversity.
  parameters:
    negative_prompt: >-
      overly colorful, overexposed, static, blurred details, subtitles, style,
      artwork, painting, still image, overall grayish, worst quality, low
      quality, JPEG compression artifacts, ugly, incomplete, extra fingers,
      poorly drawn hands, poorly drawn face, deformed, disfigured, malformed
      limbs, fused fingers, static scene, cluttered background, three legs, many
      people in background, walking backwards, people at bottom of frame, person
      holding camera
base_model: Wan-AI/Wan2.1-T2V-1.3B
instance_prompt: null
license: apache-2.0
---
<a align="center">
  <video src='https://github.com/user-attachments/assets/238d45f9-9335-42be-9b03-27ed6880ce29'></video>
</a>

<p align="center">
   <a href='https://panowan.variantconst.com'><img src='https://img.shields.io/badge/Project-Page-Green'></a> &nbsp;
   <a href='https://arxiv.org/abs/2505.22016'><img src='https://img.shields.io/badge/arXiv-2505.22016-b31b1b.svg'></a> &nbsp;
   <a href='https://huggingface.co/datasets/yousiki/PanoWan'><img src='https://img.shields.io/badge/Dataset-PanoWan-yellow'></a>
</p>

# PanoWan

Official repository for "PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms"

## Text-to-360° Video Generation

Generate panoramic videos from text prompts:

<table>
  <tr>
    <td align="center" valign="top">
      <video src="https://github.com/user-attachments/assets/6453d67b-60ad-42be-9a72-013f1449341a" width="200"></video>
    </td>
    <td align="center" valign="top">
      <video src="https://github.com/user-attachments/assets/a494213c-ca3f-4ff1-93d2-a36df8ad8790" width="200"></video>
    </td>
    <td align="center" valign="top">
      <video src="https://github.com/user-attachments/assets/4614969f-9973-421a-9fc0-4f4222e2b84b" width="200"></video>
    </td>
    <td align="center" valign="top">
      <video src="https://github.com/user-attachments/assets/746e5c19-18ef-4fab-b25b-68521f0dcea9" width="200"></video>
    </td>
  </tr>
  <tr>
    <td align="center" valign="top">
      <video src="https://github.com/user-attachments/assets/92b0c1d9-5005-4cf4-adf2-5042e7bf0abf" width="200"></video>
    </td>
    <td align="center" valign="top">
      <video src="https://github.com/user-attachments/assets/7c040660-4a9e-408f-9490-79fb5f349297" width="200"></video>
    </td>
    <td align="center" valign="top">
      <video src="https://github.com/user-attachments/assets/add58417-e00c-4fae-af6f-2cbfb0b8019b" width="200"></video>
    </td>
    <td align="center" valign="top">
      <video src="https://github.com/user-attachments/assets/96c64c3d-a3f3-4832-94e2-c3c1c6a3edf0" width="200"></video>
    </td>
  </tr>
</table>

## Zero-Shot Applications

### Long Video Generation

Generate extended panoramic videos using temporal windowing and seamless blending:

<a align="center">
  <video src='https://github.com/user-attachments/assets/1e404ed9-0165-4a67-89e3-9f9ac9ecd052'></video>
</a>

### Super Resolution

Enhance low-resolution panoramic videos to 2x resolution:

<div align="center">
  <table>
    <tr>
      <td width="50%">
        <video src="https://github.com/user-attachments/assets/ccb31e1f-133b-4fbb-86e2-084cf2edce28" width="100%"></video>
        <p align="center">Low Resolution</p>
      </td>
      <td width="50%">
        <video src="https://github.com/user-attachments/assets/d06ad051-9d25-4589-89aa-f15b85894119" width="100%"></video>
        <p align="center">High Resolution</p>
      </td>
    </tr>
  </table>
</div>

### Semantic Editing

Edit panoramic videos with text-guided modifications:

<div align="center">
  <table>
    <tr>
      <td width="50%">
        <video src="https://github.com/user-attachments/assets/951cd6ff-60df-4480-a728-9a3d38568117" width="100%"></video>
        <p align="center">Original</p>
      </td>
      <td width="50%">
        <video src="https://github.com/user-attachments/assets/c0e12443-a479-40eb-aa67-d4874b745fde" width="100%"></video>
        <p align="center">Edited</p>
      </td>
    </tr>
  </table>
</div>

### Video Outpainting

Transform conventional videos to panoramic format:

<a align="center">
  <video src='https://github.com/user-attachments/assets/92dca130-d5c0-423e-a285-9f6402b8db9d'></video>
</a>

## Dataset

The metadata for our dataset is released at [HuggingFace](https://huggingface.co/datasets/yousiki/PanoWan).

## Citation

```bibtex
@article{xia2025panowan,
  title={PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms},
  author={Xia, Yifei and Weng, Shuchen and Yang, Siqi and Liu, Jingqi and Zhu, Chengxuan and Teng, Minggui and Jia, Zijian and Jiang, Han and Shi, Boxin},
  journal={arXiv preprint arXiv:2505.22016},
  year={2025}
}
```