Video-Text-to-Text
Safetensors
qwen3_vl