File size: 5,290 Bytes
9c1b5ea daab846 9c1b5ea daab846 9c1b5ea daab846 ce2b4b7 daab846 9c1b5ea ce2b4b7 9c1b5ea daab846 9c1b5ea daab846 9c1b5ea daab846 9c1b5ea daab846 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 |
---
tags:
- text-to-video
- lora
- diffusers
- template:diffusion-lora
widget:
- output:
url: images/video-0001.png
text: >-
Stunning panoramic underwater shot of a vibrant coral reef ecosystem
brimming with marine life. Colorful fish dart effortlessly among intricate
coral formations, soft rays of sunlight filter through the crystal-clear
waters, creating mesmerizing patterns on the ocean floor. Wide-angle
capturing vivid hues and abundant biodiversity.
parameters:
negative_prompt: >-
overly colorful, overexposed, static, blurred details, subtitles, style,
artwork, painting, still image, overall grayish, worst quality, low
quality, JPEG compression artifacts, ugly, incomplete, extra fingers,
poorly drawn hands, poorly drawn face, deformed, disfigured, malformed
limbs, fused fingers, static scene, cluttered background, three legs, many
people in background, walking backwards, people at bottom of frame, person
holding camera
base_model: Wan-AI/Wan2.1-T2V-1.3B
instance_prompt: null
license: apache-2.0
---
<a align="center">
<video src='https://github.com/user-attachments/assets/238d45f9-9335-42be-9b03-27ed6880ce29'></video>
</a>
<p align="center">
<a href='https://panowan.variantconst.com'><img src='https://img.shields.io/badge/Project-Page-Green'></a>
<a href='https://arxiv.org/abs/2505.22016'><img src='https://img.shields.io/badge/arXiv-2505.22016-b31b1b.svg'></a>
<a href='https://huggingface.co/datasets/yousiki/PanoWan'><img src='https://img.shields.io/badge/Dataset-PanoWan-yellow'></a>
</p>
# PanoWan
Official repository for "PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms"
## Text-to-360° Video Generation
Generate panoramic videos from text prompts:
<table>
<tr>
<td align="center" valign="top">
<video src="https://github.com/user-attachments/assets/6453d67b-60ad-42be-9a72-013f1449341a" width="200"></video>
</td>
<td align="center" valign="top">
<video src="https://github.com/user-attachments/assets/a494213c-ca3f-4ff1-93d2-a36df8ad8790" width="200"></video>
</td>
<td align="center" valign="top">
<video src="https://github.com/user-attachments/assets/4614969f-9973-421a-9fc0-4f4222e2b84b" width="200"></video>
</td>
<td align="center" valign="top">
<video src="https://github.com/user-attachments/assets/746e5c19-18ef-4fab-b25b-68521f0dcea9" width="200"></video>
</td>
</tr>
<tr>
<td align="center" valign="top">
<video src="https://github.com/user-attachments/assets/92b0c1d9-5005-4cf4-adf2-5042e7bf0abf" width="200"></video>
</td>
<td align="center" valign="top">
<video src="https://github.com/user-attachments/assets/7c040660-4a9e-408f-9490-79fb5f349297" width="200"></video>
</td>
<td align="center" valign="top">
<video src="https://github.com/user-attachments/assets/add58417-e00c-4fae-af6f-2cbfb0b8019b" width="200"></video>
</td>
<td align="center" valign="top">
<video src="https://github.com/user-attachments/assets/96c64c3d-a3f3-4832-94e2-c3c1c6a3edf0" width="200"></video>
</td>
</tr>
</table>
## Zero-Shot Applications
### Long Video Generation
Generate extended panoramic videos using temporal windowing and seamless blending:
<a align="center">
<video src='https://github.com/user-attachments/assets/1e404ed9-0165-4a67-89e3-9f9ac9ecd052'></video>
</a>
### Super Resolution
Enhance low-resolution panoramic videos to 2x resolution:
<div align="center">
<table>
<tr>
<td width="50%">
<video src="https://github.com/user-attachments/assets/ccb31e1f-133b-4fbb-86e2-084cf2edce28" width="100%"></video>
<p align="center">Low Resolution</p>
</td>
<td width="50%">
<video src="https://github.com/user-attachments/assets/d06ad051-9d25-4589-89aa-f15b85894119" width="100%"></video>
<p align="center">High Resolution</p>
</td>
</tr>
</table>
</div>
### Semantic Editing
Edit panoramic videos with text-guided modifications:
<div align="center">
<table>
<tr>
<td width="50%">
<video src="https://github.com/user-attachments/assets/951cd6ff-60df-4480-a728-9a3d38568117" width="100%"></video>
<p align="center">Original</p>
</td>
<td width="50%">
<video src="https://github.com/user-attachments/assets/c0e12443-a479-40eb-aa67-d4874b745fde" width="100%"></video>
<p align="center">Edited</p>
</td>
</tr>
</table>
</div>
### Video Outpainting
Transform conventional videos to panoramic format:
<a align="center">
<video src='https://github.com/user-attachments/assets/92dca130-d5c0-423e-a285-9f6402b8db9d'></video>
</a>
## Dataset
The metadata for our dataset is released at [HuggingFace](https://huggingface.co/datasets/yousiki/PanoWan).
## Citation
```bibtex
@article{xia2025panowan,
title={PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms},
author={Xia, Yifei and Weng, Shuchen and Yang, Siqi and Liu, Jingqi and Zhu, Chengxuan and Teng, Minggui and Jia, Zijian and Jiang, Han and Shi, Boxin},
journal={arXiv preprint arXiv:2505.22016},
year={2025}
}
```
|