Video-Text-to-Text
Transformers
Safetensors
English
llava_llama
multimodal
video-understanding
region-grounding
3d-reasoning
4d-reasoning
perceptual-distillation
nvila
vila
Instructions to use nvidia/4D-RGPT-8B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use nvidia/4D-RGPT-8B with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("nvidia/4D-RGPT-8B", dtype="auto") - Notebooks
- Google Colab
- Kaggle