Enhance model card: Add pipeline tag, links, visuals, and usage

This PR enhances the model card for DiffusionGS by:
- Adding the `pipeline_tag: image-to-3d` for improved discoverability on the Hugging Face Hub.
- Including direct links to the paper, project page, and GitHub repository.
- Incorporating key visuals (GIFs and pipeline image) from the associated Hugging Face dataset to showcase model capabilities.
- Adding a 'Quick Demo' section with a code snippet from the GitHub README to facilitate easy initial usage.
- Adding the academic citation for the paper.

Files changed (1) hide show

README.md +46 -3

README.md CHANGED Viewed

@@ -1,3 +1,46 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+pipeline_tag: image-to-3d
+---
+# Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction
+This repository contains **DiffusionGS**, a novel single-stage 3D diffusion model for object generation and scene reconstruction from a single view. As presented in the paper, DiffusionGS directly outputs 3D Gaussian point clouds at each timestep to enforce view consistency and allows the model to generate robustly given prompt views of any directions, beyond object-centric inputs. It also features a scene-object mixed training strategy to improve capability and generality. Our method enjoys over 5× faster speed (~6s on an A100 GPU) compared to state-of-the-art methods.
+*   **Paper**: [Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction](https://huggingface.co/papers/2411.14384)
+*   **Project Page**: [https://caiyuanhao1998.github.io/project/DiffusionGS/](https://caiyuanhao1998.github.io/project/DiffusionGS/)
+*   **Code**: [https://github.com/caiyuanhao1998/Open-DiffusionGS](https://github.com/caiyuanhao1998/Open-DiffusionGS)
+<div align="center">
+  <img src="https://huggingface.co/datasets/CaiYuanhao/DiffusionGS/resolve/main/img/abo.gif" width="24%" alt="abo">
+  <img src="https://huggingface.co/datasets/CaiYuanhao/DiffusionGS/resolve/main/img/gso.gif" width="24%" alt="gso">
+  <img src="https://huggingface.co/datasets/CaiYuanhao/DiffusionGS/resolve/main/img/real_img.gif" width="24%" alt="real_img">
+  <img src="https://huggingface.co/datasets/CaiYuanhao/DiffusionGS/resolve/main/img/wild.gif" width="24%" alt="wild">
+</div>
+<div align="center">
+  <img src="https://huggingface.co/datasets/CaiYuanhao/DiffusionGS/resolve/main/img/pipeline.png" width="80%" alt="DiffusionGS Pipeline">
+</div>
+## Quick Demo
+For object-centric image-to-3D generation, a single-line script is provided to use the code:
+```shell
+python run.py
+```
+This code will automatically download the model checkpoints and config files from Hugging Face.
+## Citation
+If you find our work useful, please consider citing our paper:
+```bibtex
+@inproceedings{diffusiongs,
+  title={Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction},
+  author={Yuanhao Cai and He Zhang and Kai Zhang and Yixun Liang and Mengwei Ren and Fujun Luan and Qing Liu and Soo Ye Kim and Jianming Zhang and Zhifei Zhang and Yuqian Zhou and Yulun Zhang and Xiaokang Yang and Zhe Lin and Alan Yuille},
+  booktitle={ICCV},
+  year={2025}
+}
+```