mvp18
/

gscenes-checkpoints

3d-reconstruction

gaussian-splatting

Model card Files Files and versions

gscenes-checkpoints / README.md

mvp18's picture

Update README.md

201cc5f verified 3 months ago

|

history blame contribute delete

1.39 kB

	---
	license: openrail
	language:
	- en
	pipeline_tag: image-to-image
	datasets:
	- mvp18/gscenes_pretrain
	tags:
	- diffusion
	- image-to-image
	- image-to-3d
	- 3d-reconstruction
	- gaussian-splatting
	- pose-free
	- sparse-view
	- rgbd
	base_model:
	- stabilityai/stable-diffusion-2
	---

	## Summary

	This repository provides checkpoints used in the Gaussian Scenes pipeline for pose-free, sparse-view scene reconstruction. The weights are stored in Diffusers format and organized as two components:

	- UNet — denoising backbone (Diffusers UNet) adapted for our pipeline.
	- VAE — variational autoencoder used for latent encoding/decoding.

	These checkpoints are intended for research use and model reproducibility.

	## Usage

	For a guide on how to use this model, check out the [official repository](https://github.com/gaussian-scenes).

	## Citation

	If you use these checkpoints in your work, please cite the associated paper:

	```
	@article{
	paul2025gaussian,
	title={Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors},
	author={Soumava Paul and Prakhar Kaushik and Alan Yuille},
	journal={Transactions on Machine Learning Research},
	issn={2835-8856},
	year={2025},
	url={https://openreview.net/forum?id=yp1CYo6R0r},
	note={}
	}
	```

	The HuggingFace paper page can be found [here](https://huggingface.co/papers/2411.15966).