Video-Text-to-Text
Transformers
Safetensors
English
moss_vl
feature-extraction
SFT
Video-Understanding
Image-Understanding
MOSS-VL
OpenMOSS
multimodal
video
vision-language
custom_code
Instructions to use OpenMOSS-Team/MOSS-VL-Instruct-0408 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenMOSS-Team/MOSS-VL-Instruct-0408 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenMOSS-Team/MOSS-VL-Instruct-0408", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -8,7 +8,7 @@ language:
|
|
| 8 |
library_name: transformers
|
| 9 |
pipeline_tag: video-text-to-text
|
| 10 |
license: apache-2.0
|
| 11 |
-
base_model: fnlp-vision/
|
| 12 |
tags:
|
| 13 |
- SFT
|
| 14 |
- Video-Understanding
|
|
|
|
| 8 |
library_name: transformers
|
| 9 |
pipeline_tag: video-text-to-text
|
| 10 |
license: apache-2.0
|
| 11 |
+
base_model: fnlp-vision/mossvl_base_0408
|
| 12 |
tags:
|
| 13 |
- SFT
|
| 14 |
- Video-Understanding
|