Transformers How to use Video-R1/Qwen2.5-VL-7B-COT-SFT with Transformers:
# Load model directly
from transformers import AutoProcessor, AutoModelForImageTextToText
processor = AutoProcessor.from_pretrained("Video-R1/Qwen2.5-VL-7B-COT-SFT")
model = AutoModelForImageTextToText.from_pretrained("Video-R1/Qwen2.5-VL-7B-COT-SFT")