Duplicated from DAMO-NLP-SG/VideoLLaMA2-7B-16F

ccclemenfff
/

AVL

Visual Question Answering

videollama2_mistral

text-generation

multimodal large language model

large video-language model

Model card Files Files and versions

ccclemenfff commited on Jul 1, 2025

Commit

e65359a

·

1 Parent(s): 2e0bbea

Disable Flash Attention for CLIP

Files changed (1) hide show

handler.py +2 -0

handler.py CHANGED Viewed

@@ -3,6 +3,8 @@ import base64
 import tempfile
 import os
 import sys
 # 确保能导入 videollama2 模块（模型代码需要放同目录或已安装）
 sys.path.append('./')

 import tempfile
 import os
 import sys
+os.environ["FLASH_ATTENTION_2_ENABLED"] = "false"
 # 确保能导入 videollama2 模块（模型代码需要放同目录或已安装）
 sys.path.append('./')