4D-Humans (HMR 2.0) β mesh from video π§ not trained yet
Recover and track an animated 3D SMPL body from your own image or video.
Status β documented recipe (placeholder). A production-grade pipeline from Ropedia Academy for an advanced, GPU-heavy task. Everything below β base model, objective, dataset, config, the exact evaluation β is specified; the weights / metrics / figures land here automatically when you run the notebook on a GPU (one click below). Try the trained models live in the Ropedia demos Space.
At a glance
| Base model | HMR 2.0 / 4D-Humans (ViT-based, pretrained) |
| Task | 3D human mesh recovery |
| Training objective | Regress SMPL pose+shape from image crops; track across video. |
| Track | A Β· Human modeling |
| Built on | shubham-goel/4D-Humans |
| Notebook | |
| Compute / storage / time | GPU required β see the Compute Β· storage Β· time table in the notebook |
Dataset
- Source: Inference: your image/video. Training: H36M, 3DPW, COCO, InstaVariety.
Training config
GPU-scale β the notebook ships a demo profile (free Colab T4) and a full profile, with an exact Compute Β· storage Β· time table. Hyperparameters (optimizer, steps, batch, LoRA rank, β¦) are in the training cell.
Evaluation results
β³ Pending β run the notebook on a GPU to fill this in. This lab reports PA-MPJPE / MPJPE (mm) Β· PCK on a held-out split (see its Evaluate cell).
Inference example
No weights are published yet. After a GPU run, load the checkpoint/adapter the notebook saves (it also has a ready inference cell). Base model: HMR 2.0 / 4D-Humans (ViT-based, pretrained).
How to fill this repo
- Open the notebook in Colab β Runtime β GPU β Run all (runs the real pipeline).
- Run its Publish to the Hugging Face Hub step (or
HfApi().upload_folder(...)) β the checkpoint +metrics.json+ figures replace this placeholder.
- Train / run on a GPU Β· [ ] upload weights Β· [ ] add
metrics.jsonΒ· [ ] add figures Β· [ ] swap in the real results card
Limitations
Not yet trained β no numbers to report. The pipeline is GPU-heavy (see the compute table); on free Colab use the demo-scale settings. This is an educational, reproducible recipe, not a tuned production release.
License
Code: MIT (this repository). The base model (shubham-goel/4D-Humans) and dataset are each under their own licenses β check the upstream source before redistribution.
Citation
@misc{ropedia_academy,
title = {Ropedia Academy: an interactive course on embodied & spatial AI},
author = {Ropedia Academy},
year = {2026},
howpublished = {\url{https://chaoyue0307.github.io/ropedia-academy/}}
}
Method / original work: Goel et al., Humans in 4D, ICCV 2023.
Related assets
- π Live demos: https://huggingface.co/spaces/cy0307/ropedia-demos
- π€ All models + collection: https://huggingface.co/cy0307
- π Course & all labs: https://chaoyue0307.github.io/ropedia-academy/ Β· Labs tab
- π» Source / notebooks: github.com/ChaoYue0307/ropedia-academy
- π Relates to tracks: C Β· D
Documented placeholder in the Ropedia Academy collection β train it on a GPU to publish the real model. Contributions welcome on GitHub.