Video-Text-to-Text
Transformers
Safetensors
English
llava
text-generation
multimodal
vision-language
video understanding
spatial reasoning
visuospatial cognition
qwen
llava-video
Instructions to use nkkbr/ViCA-ScanNetPP with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use nkkbr/ViCA-ScanNetPP with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForCausalLM processor = AutoProcessor.from_pretrained("nkkbr/ViCA-ScanNetPP") model = AutoModelForCausalLM.from_pretrained("nkkbr/ViCA-ScanNetPP") - Notebooks
- Google Colab
- Kaggle
Welcome to the community
The community tab is the place to discuss and collaborate with the HF community!