Instructions to use Salesforce/blip-vqa-capfilt-large with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Salesforce/blip-vqa-capfilt-large with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="Salesforce/blip-vqa-capfilt-large")# Load model directly from transformers import AutoProcessor, AutoModelForVisualQuestionAnswering processor = AutoProcessor.from_pretrained("Salesforce/blip-vqa-capfilt-large") model = AutoModelForVisualQuestionAnswering.from_pretrained("Salesforce/blip-vqa-capfilt-large") - Notebooks
- Google Colab
- Kaggle
Update README.md
#2
by ybelkada - opened
README.md
CHANGED
|
@@ -10,7 +10,7 @@ license: bsd-3-clause
|
|
| 10 |
|
| 11 |
# BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
|
| 12 |
|
| 13 |
-
Model card for BLIP trained on visual question answering -
|
| 14 |
|
| 15 |
|  |
|
| 16 |
|:--:|
|
|
|
|
| 10 |
|
| 11 |
# BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
|
| 12 |
|
| 13 |
+
Model card for BLIP trained on visual question answering - large architecture (with ViT large backbone).
|
| 14 |
|
| 15 |
|  |
|
| 16 |
|:--:|
|