Video-Text-to-Text
Safetensors
pixel_qwen2_5_vl