Visual Question Answering
Transformers
Safetensors
English
videollama2_mistral
text-generation
multimodal large language model
large video-language model
Instructions to use ccclemenfff/AVL with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ccclemenfff/AVL with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="ccclemenfff/AVL")# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("ccclemenfff/AVL", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Commit ·
e65359a
1
Parent(s): 2e0bbea
Disable Flash Attention for CLIP
Browse files- handler.py +2 -0
handler.py
CHANGED
|
@@ -3,6 +3,8 @@ import base64
|
|
| 3 |
import tempfile
|
| 4 |
import os
|
| 5 |
import sys
|
|
|
|
|
|
|
| 6 |
|
| 7 |
# 确保能导入 videollama2 模块(模型代码需要放同目录或已安装)
|
| 8 |
sys.path.append('./')
|
|
|
|
| 3 |
import tempfile
|
| 4 |
import os
|
| 5 |
import sys
|
| 6 |
+
os.environ["FLASH_ATTENTION_2_ENABLED"] = "false"
|
| 7 |
+
|
| 8 |
|
| 9 |
# 确保能导入 videollama2 模块(模型代码需要放同目录或已安装)
|
| 10 |
sys.path.append('./')
|