File size: 6,872 Bytes

---
license: apache-2.0
language:
- en
pipeline_tag: image-to-3d
modalities:
- image
- point clouds
- mesh
arxiv: 2411.14384
---

# [ICCV 2025] DiffusionGS: Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction

## Model Description

These three models are trained for image-to-3D generation on object- and scene-level with the spatial resolution of 256x256 and 512x512. For object-level 3D generation,
mesh exportation is also supported. Here are some generated examples:

· (a) Object-level Generation

<p align="center">
<table border="0" cellspacing="0" cellpadding="0" style="border-collapse:collapse;border:0;">

  <!-- ===== Row 1 ===== -->
  <tr>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/abo.gif" width="210">
    </td>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/gso.gif" width="210">
    </td>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/real_img.gif" width="210">
    </td>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/wild.gif" width="210">
    </td>
  </tr>

  <!-- ===== Row 2 ===== -->
  <tr>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/sd_2.gif" width="210">
    </td>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/sd_1.gif" width="210">
    </td>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/flux_1.gif" width="210">
    </td>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/green_man.gif" width="210">
    </td>
  </tr>

</table>
</p>

· (b) Mesh Exportation

<img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/mesh.png" width="880">

· (c) Scene-level Generation

<p align="center">
<table border="0" cellspacing="0" cellpadding="0" style="border-collapse:collapse;border:0;">

  <!-- ===== Row 1 ===== -->
  <tr>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/plaza.gif" width="430">
    </td>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/town.gif" width="410">
    </td>
  </tr>

  <!-- ===== Row 2 ===== -->
  <tr>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/cliff.gif" width="425">
    </td>
    <td style="border:0;padding:10px;">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/art_gallery.gif" width="415">
    </td>
  </tr>

</table>
</p>


· (d) Comparison with Hunyuan3D-v2.5

The first row is the prompt image. The second row is Hunyuan3D-v2.5. The third row is our DiffusionGS.

Our method generates better results while enjoying 7.5x faster inference speed.



<p align="center">
<table border="0" cellspacing="0" cellpadding="0" style="border-collapse:collapse;border:0;">

  <!-- ===== Row 1: Prompt Image ===== -->
  <tr>
    <td style="border:0;padding:10px;" align="center">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/1.png" width="300">
    </td>
    <td style="border:0;padding:10px;" align="center">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/2.jpg" width="300">
    </td>
    <td style="border:0;padding:10px;" align="center">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/3.png" width="300">
    </td>
  </tr>
  <tr>
    <td colspan="3" align="center" style="padding-top:0px;font-style:italic;font-size:24px;">
      Prompt Images at Any Viewpoints  
    </td>
  </tr>

  <!-- ===== Row 2: Hunyuan3D ===== -->
  <tr>
    <td style="border:0;padding:10px;" align="center">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/hunyuan_1.gif" width="300">
    </td>
    <td style="border:0;padding:10px;" align="center">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/hunyuan_2.gif" width="300">
    </td>
    <td style="border:0;padding:10px;" align="center">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/hunyuan_3.gif" width="300">
    </td>
  </tr>
  <tr>
    <td colspan="3" align="center" style="padding-top:0px;font-style:italic;font-size:24px;">
      Tencent Hunyuan3D-v2.5 (Inference Time: 180 seconds)
    </td>
  </tr>

  <!-- ===== Row 3: DiffusionGS ===== -->
  <tr>
    <td style="border:0;padding:10px;" align="center">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/ours_1.gif" width="300">
    </td>
    <td style="border:0;padding:10px;" align="center">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/ours_2.gif" width="300">
    </td>
    <td style="border:0;padding:10px;" align="center">
      <img src="https://raw.githubusercontent.com/caiyuanhao1998/Open-DiffusionGS/master/img/ours_3.gif" width="300">
    </td>
  </tr>
  <tr>
    <td colspan="3" align="center" style="padding-top:0px;font-style:italic;font-size:24px;">
    Our DiffusionGS (Inference Time: 24 seconds)
    </td>
  </tr>

</table>
</p>



## Github Code Link

Please refer to our GitHub repo for more detailed instructions on using our code and models.

https://github.com/caiyuanhao1998/Open-DiffusionGS/


## Project Page Link

For more video and interactive generation results, please refer to our project page:

https://caiyuanhao1998.github.io/project/DiffusionGS/


## Arxiv Paper Link

For more technical details, please refer to our ICCV 2025 paper:

https://arxiv.org/abs/2411.14384


## Citation

If you find our code, data, and models useful, please consider citing our paper:

```sh
@inproceedings{diffusiongs,
  title={Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction},
  author={Yuanhao Cai and He Zhang and Kai Zhang and Yixun Liang and Mengwei Ren and Fujun Luan and Qing Liu and Soo Ye Kim and Jianming Zhang and Zhifei Zhang and Yuqian Zhou and Yulun Zhang and Xiaokang Yang and Zhe Lin and Alan Yuille},
  booktitle={ICCV},
  year={2025}
}
```