Video-Text-to-Text
Transformers
Safetensors
English
moss_vl
feature-extraction
SFT
Video-Understanding
Image-Understanding
MOSS-VL
OpenMOSS
multimodal
video
vision-language
custom_code
Instructions to use OpenMOSS-Team/MOSS-VL-Instruct-0408 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenMOSS-Team/MOSS-VL-Instruct-0408 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenMOSS-Team/MOSS-VL-Instruct-0408", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Upload folder using huggingface_hub
Browse files
README.md
CHANGED
|
@@ -99,11 +99,6 @@ conda activate moss_vl
|
|
| 99 |
pip install -i https://pypi.org/simple --no-build-isolation -r requirements.txt
|
| 100 |
```
|
| 101 |
|
| 102 |
-
Validated setup notes:
|
| 103 |
-
|
| 104 |
-
- CUDA runtime used for validation: `12.8`
|
| 105 |
-
- Inference loading uses `trust_remote_code=True` and `attn_implementation="flash_attention_2"`
|
| 106 |
-
|
| 107 |
### 🏃 Run Inference
|
| 108 |
|
| 109 |
|
|
|
|
| 99 |
pip install -i https://pypi.org/simple --no-build-isolation -r requirements.txt
|
| 100 |
```
|
| 101 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 102 |
### 🏃 Run Inference
|
| 103 |
|
| 104 |
|