Video-Text-to-Text
Transformers
Safetensors
English
moss_vl
feature-extraction
Base
Video-Understanding
Image-Understanding
MOSS-VL
OpenMOSS
multimodal
video
vision-language
custom_code
Instructions to use OpenMOSS-Team/MOSS-VL-Base-0408 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenMOSS-Team/MOSS-VL-Base-0408 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenMOSS-Team/MOSS-VL-Base-0408", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Upload logo.png
Browse files- .gitattributes +1 -0
- logo.png +3 -0
.gitattributes
CHANGED
|
@@ -39,3 +39,4 @@ assets/structure.png filter=lfs diff=lfs merge=lfs -text
|
|
| 39 |
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
| 40 |
assets/benchmark_table.png filter=lfs diff=lfs merge=lfs -text
|
| 41 |
assets/MOSS-VL-benchmark.png filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 39 |
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
| 40 |
assets/benchmark_table.png filter=lfs diff=lfs merge=lfs -text
|
| 41 |
assets/MOSS-VL-benchmark.png filter=lfs diff=lfs merge=lfs -text
|
| 42 |
+
logo.png filter=lfs diff=lfs merge=lfs -text
|
logo.png
ADDED
|
Git LFS Details
|