DAMO-NLP-SG
/

VideoLLaMA2.1-7B-AV

Visual Question Answering

videollama2_qwen2

text-generation

Audio-visual Question Answering

Audio Question Answering

multimodal large language model

Model card Files Files and versions

Resources

View closed (0)

Some weights of Videollama2Qwen2ForCausalLM were not initialized from the model checkpoint at ./VideoLLaMA2.1-7B-AV and are newly initialized:

#4 opened over 1 year ago by deleted

Does this model support 'image' inference?

#3 opened over 1 year ago by

can you please tell me if the paths in config.json here needs to be modified when I run AV branch?

#2 opened over 1 year ago by

'process_video() got an unexpected keyword argument 'va'‘

#1 opened over 1 year ago by