VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs Paper β’ 2406.07476 β’ Published Jun 11, 2024 β’ 37
DAMO-NLP-SG/VideoLLaMA2.1-7B-AV Visual Question Answering β’ 9B β’ Updated Oct 25, 2024 β’ 2.43k β’ 16
DAMO-NLP-SG/VideoLLaMA2-7B-16F Visual Question Answering β’ 8B β’ Updated Aug 13, 2024 β’ 123 β’ 14
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F-Base Visual Question Answering β’ Updated Oct 21, 2024 β’ 10 β’ 1