Visual Question Answering
Transformers
English
videollama2_mixtral
text-generation
multimodal large language model
large video-language model
Instructions to use DAMO-NLP-SG/VideoLLaMA2-8x7B-Base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use DAMO-NLP-SG/VideoLLaMA2-8x7B-Base with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="DAMO-NLP-SG/VideoLLaMA2-8x7B-Base")# Load model directly from transformers import AutoModelForMultimodalLM model = AutoModelForMultimodalLM.from_pretrained("DAMO-NLP-SG/VideoLLaMA2-8x7B-Base", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Commit History
Update config.json bab71d8 verified
Zesen Cheng commited on
Update README.md a2c9800 verified
update 72b model d99f1ce verified
Update README.md b41dd4a verified
Update README.md f1f1ca1 verified
Update README.md c992340 verified
copy 7b base 787f367 verified
Upload model weights. a8fb83c
clownrat6 commited on
initial commit 9f0b48f verified
Zesen Cheng commited on