DAMO-NLP-SG
/

VideoLLaMA3-2B-Image

Visual Question Answering

videollama3_qwen2

text-generation

large-language-model

video-language-model

Model card Files Files and versions

Cyril666 commited on Jan 31, 2025

Commit

d0d941d

·

verified ·

1 Parent(s): 592bf39

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -90,7 +90,7 @@ conversation = [
         "role": "user",
         "content": [
             {"type": "image", "image": {"image_path": "https://github.com/DAMO-NLP-SG/VideoLLaMA3/blob/main/assets/sora.png?raw=true"}},
-            {"type": "text", "data": "What is the woman wearing?"},
         ]
     }
 ]

         "role": "user",
         "content": [
             {"type": "image", "image": {"image_path": "https://github.com/DAMO-NLP-SG/VideoLLaMA3/blob/main/assets/sora.png?raw=true"}},
+            {"type": "text", "text": "What is the woman wearing?"},
         ]
     }
 ]