How to use csarron/vilt-vqa2-ft with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="csarron/vilt-vqa2-ft")
# Load model directly from transformers import AutoProcessor, AutoModelForVisualQuestionAnswering processor = AutoProcessor.from_pretrained("csarron/vilt-vqa2-ft") model = AutoModelForVisualQuestionAnswering.from_pretrained("csarron/vilt-vqa2-ft")