nvidia
/

4D-RGPT-8B

+---
+license: cc-by-nc-4.0
+library_name: transformers
+pipeline_tag: video-text-to-text
+tags:
+  - multimodal
+  - video-understanding
+  - region-grounding
+  - 3d-reasoning
+  - 4d-reasoning
+  - perceptual-distillation
+  - nvila
+  - vila
+base_model: Efficient-Large-Model/NVILA-Lite-8B
+language:
+  - en
+datasets:
+  - nvidia/R4D-Bench
+---
 # Model Overview
 ### Description: