Visual Question Answering
Transformers
Safetensors
English
videollama3_qwen2
text-generation
multi-modal
large-language-model
video-language-model
custom_code
Instructions to use DAMO-NLP-SG/VideoLLaMA3-7B-Image with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use DAMO-NLP-SG/VideoLLaMA3-7B-Image with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="DAMO-NLP-SG/VideoLLaMA3-7B-Image", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("DAMO-NLP-SG/VideoLLaMA3-7B-Image", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Commit History
Update image_processing_videollama3.py fe1866b verified
Update processing_videollama3.py 282c2a2 verified
Update modeling_videollama3.py 672428a verified
Update processing_videollama3.py 2701e26 verified
Update configuration_videollama3_encoder.py 0f3981b verified
Update README.md fac6fc5 verified
Update processing_videollama3.py c636d6d verified
Update processing_videollama3.py a6b1df0 verified
Update README.md 5c784ee verified
Update README.md 972bf89 verified
Update image_processing_videollama3.py 54e25e0 verified
Update processing_videollama3.py 72c23b0 verified
Update README.md b5afc99 verified
Update README.md 193cdd9 verified
Update README.md 7059329 verified
Update README.md 68b86c8 verified
Upload Videollama3Qwen2ForCausalLM 5a918de verified
Zesen Cheng commited on
Upload processor 668d25a verified
Zesen Cheng commited on
initial commit 7bf6da6 verified
Zesen Cheng commited on