Video-Text-to-Text
Transformers
Safetensors
English
multimodal
video
vision-language
mllama
streaming
realtime
low-latency
Instructions to use OpenMOSS-Team/moss-video-preview-realtime-sft with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenMOSS-Team/moss-video-preview-realtime-sft with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenMOSS-Team/moss-video-preview-realtime-sft", dtype="auto") - Notebooks
- Google Colab
- Kaggle