Image-Text-to-Text
English
hpsv3
qwen2_5_vl