VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs Paper β’ 2406.07476 β’ Published Jun 11, 2024 β’ 37
DAMO-NLP-SG/VideoLLaMA2.1-7B-AV Visual Question Answering β’ 9B β’ Updated Oct 25, 2024 β’ 2.43k β’ 16
DAMO-NLP-SG/VideoLLaMA2-7B-16F Visual Question Answering β’ 8B β’ Updated Aug 13, 2024 β’ 117 β’ 14