Transformers How to use OpenGVLab/VideoChat-R1_7B_caption with Transformers:
# Load model directly
from transformers import AutoProcessor, AutoModelForImageTextToText
processor = AutoProcessor.from_pretrained("OpenGVLab/VideoChat-R1_7B_caption")
model = AutoModelForImageTextToText.from_pretrained("OpenGVLab/VideoChat-R1_7B_caption")