Video-Text-to-Text
Transformers
Safetensors
English
moss_vl
feature-extraction
SFT
Video-Understanding
Image-Understanding
MOSS-VL
OpenMOSS
multimodal
video
vision-language
custom_code
Instructions to use OpenMOSS-Team/MOSS-VL-Instruct-0408 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenMOSS-Team/MOSS-VL-Instruct-0408 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenMOSS-Team/MOSS-VL-Instruct-0408", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -294,7 +294,7 @@ MOSS-VL-Instruct-0408 represents an early milestone in the MOSS-VL roadmap, and
|
|
| 294 |
## 📜 Citation
|
| 295 |
```bibtex
|
| 296 |
@misc{moss_vl_2026,
|
| 297 |
-
title = {
|
| 298 |
author = {OpenMOSS Team},
|
| 299 |
year = {2026},
|
| 300 |
howpublished = {\url{https://github.com/OpenMOSS/MOSS-VL}},
|
|
|
|
| 294 |
## 📜 Citation
|
| 295 |
```bibtex
|
| 296 |
@misc{moss_vl_2026,
|
| 297 |
+
title = {MOSS-VL Technical Report},
|
| 298 |
author = {OpenMOSS Team},
|
| 299 |
year = {2026},
|
| 300 |
howpublished = {\url{https://github.com/OpenMOSS/MOSS-VL}},
|