Transformers How to use jmonas/ViLT-33M-vqa with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("visual-question-answering", model="jmonas/ViLT-33M-vqa") # Load model directly
from transformers import AutoProcessor, AutoModelForVisualQuestionAnswering
processor = AutoProcessor.from_pretrained("jmonas/ViLT-33M-vqa")
model = AutoModelForVisualQuestionAnswering.from_pretrained("jmonas/ViLT-33M-vqa")