How to use microsoft/git-large-vqav2 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="microsoft/git-large-vqav2")
# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("microsoft/git-large-vqav2") model = AutoModelForImageTextToText.from_pretrained("microsoft/git-large-vqav2")
“This checkpoint is "GIT-large", which is a smaller variant of GIT trained on 20 million image-text pairs. ” 这部分补充有误,这应该不是small的?
· Sign up or log in to comment