KlingTeam
/

StereoPilot

Add pipeline tag, library name, and explicit links to model card

by nielsr HF Staff - opened Dec 19, 2025

←

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,6 +1,9 @@
 ---
 license: mit
 ---
 # StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors
 <!-- <div align="center" style="margin-top: 0px; margin-bottom: 0px;">
@@ -17,6 +20,8 @@ _**[Guibao Shen](https://a-bigbao.github.io)<sup>1,3*†</sup>, [Yihua Du](https
 </div>
 ## 📖 Introduction
 **TL;DR:** We propose **StereoPilot**, an efficient feed-forward architecture that leverages pretrained video diffusion transformers to directly synthesize novel views, overcoming the limitations of *Depth-Warp-Inpaint* methods without iterative denoising. With a domain switcher and cycle consistency loss, it enables robust multi-format stereo conversion. We also introduce **UniStereo**, the first large-scale unified dataset featuring both parallel and converged stereo formats.

 ---
 license: mit
+pipeline_tag: image-to-video
+library_name: diffusers
 ---
 # StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors
 <!-- <div align="center" style="margin-top: 0px; margin-bottom: 0px;">
 </div>
+### [[Project Page]](https://hit-perfect.github.io/StereoPilot/) [[arXiv]](https://arxiv.org/abs/2512.16915) [[Code]](https://github.com/KlingTeam/StereoPilot) [Dataset]
 ## 📖 Introduction
 **TL;DR:** We propose **StereoPilot**, an efficient feed-forward architecture that leverages pretrained video diffusion transformers to directly synthesize novel views, overcoming the limitations of *Depth-Warp-Inpaint* methods without iterative denoising. With a domain switcher and cycle consistency loss, it enables robust multi-format stereo conversion. We also introduce **UniStereo**, the first large-scale unified dataset featuring both parallel and converged stereo formats.