DAMO-NLP-SG
/

VL3-SigLIP-NaViT

Image Feature Extraction

videollama3_vision_encoder

feature-extraction

multi-modal-large-language-model

Model card Files Files and versions

VL3-SigLIP-NaViT

824 MB

3 contributors

History: 15 commits

seungeon-enerzai's picture

seungeon-enerzai

Update image_processing_videollama3.py

6c30fd5 verified 3 months ago