kangxuey commited on
Commit
c876909
·
verified ·
1 Parent(s): 5baecea

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -3
README.md CHANGED
@@ -20,12 +20,14 @@ pipeline_tag: image-to-3d
20
 
21
  ## **Description:**
22
 
23
- **Asset Harvester** generates 3D assets from a single image or multiple images of vehicles or VRUs.
 
24
  It leverages 4 models (see the white paper for architecture) in the process.
25
  The [AV object Mask2former](model_cards/AV_Object_Mask2former.md) instance segmentation model is used for image processing when parsing input views from NCore data sessions.
26
  The input images are encoded by [C-Radio](https://huggingface.co/nvidia/C-RADIO),
27
- and the multiview diffusion model, [SparseViewDiT](model_cards/MultiviewDiffusion.md), is then used to generate 16 multiview images of the input objects,
28
- and lastly an [Object TokenGS](model_cards/Object_TokenGS.md) lifts the images to a 3D asset.
 
29
 
30
  This system is ready for commercial/non-commercial use
31
 
 
20
 
21
  ## **Description:**
22
 
23
+ **Asset Harvester** generates 3D assets from a single image or multiple images of vehicles or VRUs extracted from autonomous driving sessions.
24
+
25
  It leverages 4 models (see the white paper for architecture) in the process.
26
  The [AV object Mask2former](model_cards/AV_Object_Mask2former.md) instance segmentation model is used for image processing when parsing input views from NCore data sessions.
27
  The input images are encoded by [C-Radio](https://huggingface.co/nvidia/C-RADIO),
28
+ and the multiview diffusion model, [SparseViewDiT](model_cards/MultiviewDiffusion.md), is then used to generate 16 multiview images of the input objects.
29
+ In cases where camera parameters are not provided, the multiview diffusion model includes a camera pose estimation submodule that predicts camera parameters for the input images.
30
+ Lastly, an [Object TokenGS](model_cards/Object_TokenGS.md) lifts the images to a 3D asset.
31
 
32
  This system is ready for commercial/non-commercial use
33