How to use jmonas/ViLT-5M-vqa with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="jmonas/ViLT-5M-vqa")
# Load model directly from transformers import AutoProcessor, AutoModelForVisualQuestionAnswering processor = AutoProcessor.from_pretrained("jmonas/ViLT-5M-vqa") model = AutoModelForVisualQuestionAnswering.from_pretrained("jmonas/ViLT-5M-vqa")
The community tab is the place to discuss and collaborate with the HF community!