DAMO-NLP-SG
/

VL3-SigLIP-NaViT

Image Feature Extraction

videollama3_vision_encoder

feature-extraction

multi-modal-large-language-model

Model card Files Files and versions

Training details

#2

by lucasjin - opened Jan 23, 2025

any details on how does this model trained?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment