nielsr HF Staff commited on
Commit
53c2ba9
·
verified ·
1 Parent(s): 24e0558

Improve model card: Add paper link, abstract, usage, and citation; refine metadata

Browse files

This PR significantly enhances the model card for `SS4D: Native 4D Generative Model via Structured Spacetime Latents` by:

- **Updating the title** to include the full paper name for better clarity.
- **Removing `library_name: ss4d`** from the metadata, as it refers to a custom implementation and does not correspond to a standard Hugging Face library intended for automated code snippets on the Hub, as per guidelines.
- **Adding a direct link to the paper** on the Hugging Face Hub (`https://huggingface.co/papers/2512.14284`).
- **Including the paper's abstract** for a concise overview of the model.
- **Embedding the teaser image** from the GitHub repository for visual context.
- **Adding detailed installation instructions** from the GitHub README.
- **Including a sample usage snippet** (`python test.py`) as found in the GitHub README.
- **Adding the academic citation** for the work.
- **Maintaining the existing project page and code links**, along with `language: en`, `license: mit`, and `pipeline_tag: image-to-3d`.

These changes aim to make the model card more comprehensive and user-friendly, providing essential information directly on the Hub.

Files changed (1) hide show
  1. README.md +52 -7
README.md CHANGED
@@ -1,14 +1,59 @@
1
  ---
2
- library_name: ss4d
3
- pipeline_tag: image-to-3d
4
- license: mit
5
  language:
6
  - en
 
 
7
  ---
8
- # SS4D
9
 
10
- Official model weights for *SS4D: Native 4D Generative Model via Structured Spacetime Latents*
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
 
12
- Project page: https://lizb6626.github.io/SS4D/
13
 
14
- Code: https://github.com/Lizb6626/SS4D/
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
 
 
2
  language:
3
  - en
4
+ license: mit
5
+ pipeline_tag: image-to-3d
6
  ---
 
7
 
8
+ # SS4D: Native 4D Generative Model via Structured Spacetime Latents
9
+
10
+ Official model weights for the paper [SS4D: Native 4D Generative Model via Structured Spacetime Latents](https://huggingface.co/papers/2512.14284).
11
+
12
+ [Paper](https://huggingface.co/papers/2512.14284) | [Project Page](https://lizb6626.github.io/SS4D/) | [Code](https://github.com/Lizb6626/SS4D/)
13
+
14
+ <p align="center"><img src="https://github.com/Lizb6626/SS4D/raw/main/assets/teaser.jpg" width="100%"></p>
15
+
16
+ We present SS4D, a native 4D generative model that synthesizes dynamic 3D objects directly from monocular video. Unlike prior approaches that construct 4D representations by optimizing over 3D or video generative models, we train a generator directly on 4D data, achieving high fidelity, temporal coherence, and structural consistency. At the core of our method is a compressed set of structured spacetime latents. Specifically, (1) To address the scarcity of 4D training data, we build on a pre-trained single-image-to-3D model, preserving strong spatial consistency. (2) Temporal consistency is enforced by introducing dedicated temporal layers that reason across frames. (3) To support efficient training and inference over long video sequences, we compress the latent sequence along the temporal axis using factorized 4D convolutions and temporal downsampling blocks. In addition, we employ a carefully designed training strategy to enhance robustness against occlusion and motion blur, leading to high-quality generation. Extensive experiments show that SS4D produces spatio-temporally consistent 4D objects with superior quality and efficiency, significantly outperforming state-of-the-art methods on both synthetic and real-world datasets.
17
+
18
+ ## Installation
19
+
20
+ To set up the environment, follow the instructions from the GitHub repository:
21
+
22
+ ```sh
23
+ git clone --recurse-submodules git@github.com:Lizb6626/SS4D.git
24
+ cd SS4D
25
+ ./setup.sh --new-env --basic --xformers --flash-attn --diffoctreerast --spconv --mipgaussian --kaolin --nvdiffrast
26
+ ```
27
+
28
+ Alternatively, you can follow the detailed installation guidance provided by [TRELLIS](https://github.com/microsoft/TRELLIS/tree/main?tab=readme-ov-file#-installation).
29
+
30
+ ## Usage
31
+
32
+ We provide a minimal example to run our method on the provided examples:
33
+
34
+ ```
35
+ python test.py
36
+ ```
37
 
38
+ ## Citation
39
 
40
+ ```bibtex
41
+ @article{li2025ss4d,
42
+ author = {Li, Zhibing and Zhang, Mengchen and Wu, Tong and Tan, Jing and Wang, Jiaqi and Lin, Dahua},
43
+ title = {SS4D: Native 4D Generative Model via Structured Spacetime Latents},
44
+ year = {2025},
45
+ issue_date = {December 2025},
46
+ publisher = {Association for Computing Machinery},
47
+ address = {New York, NY, USA},
48
+ volume = {44},
49
+ number = {6},
50
+ issn = {0730-0301},
51
+ url = {https://doi.org/10.1145/3763302},
52
+ doi = {10.1145/3763302},
53
+ journal = {ACM Trans. Graph.},
54
+ month = dec,
55
+ articleno = {244},
56
+ numpages = {12},
57
+ keywords = {4D generation, 3D generation, animation, generative model}
58
+ }
59
+ ```