nvidia
/

asset-harvester

asset-harvester

gaussian-splatting

Model card Files Files and versions

kangxuey commited on Mar 23

Commit

c876909

·

verified ·

1 Parent(s): 5baecea

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -20,12 +20,14 @@ pipeline_tag: image-to-3d
 ## **Description:**
-**Asset Harvester** generates 3D assets from a single image or multiple images of vehicles or VRUs.
 It leverages 4 models (see the white paper for architecture) in the process.
 The [AV object Mask2former](model_cards/AV_Object_Mask2former.md) instance segmentation model is used for image processing when parsing input views from NCore data sessions.
 The input images are encoded by [C-Radio](https://huggingface.co/nvidia/C-RADIO),
-and the multiview diffusion model, [SparseViewDiT](model_cards/MultiviewDiffusion.md), is then used to generate 16 multiview images of the input objects,
-and lastly an [Object TokenGS](model_cards/Object_TokenGS.md) lifts the images to a 3D asset.
 This system is ready for commercial/non-commercial use

 ## **Description:**
+**Asset Harvester** generates 3D assets from a single image or multiple images of vehicles or VRUs extracted from autonomous driving sessions.
 It leverages 4 models (see the white paper for architecture) in the process.
 The [AV object Mask2former](model_cards/AV_Object_Mask2former.md) instance segmentation model is used for image processing when parsing input views from NCore data sessions.
 The input images are encoded by [C-Radio](https://huggingface.co/nvidia/C-RADIO),
+and the multiview diffusion model, [SparseViewDiT](model_cards/MultiviewDiffusion.md), is then used to generate 16 multiview images of the input objects.
+In cases where camera parameters are not provided, the multiview diffusion model includes a camera pose estimation submodule that predicts camera parameters for the input images.
+Lastly, an [Object TokenGS](model_cards/Object_TokenGS.md) lifts the images to a 3D asset.
 This system is ready for commercial/non-commercial use