Instructions to use google/pix2struct-ai2d-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use google/pix2struct-ai2d-base with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="google/pix2struct-ai2d-base")# Load model directly from transformers import AutoProcessor, AutoModelForMultimodalLM processor = AutoProcessor.from_pretrained("google/pix2struct-ai2d-base") model = AutoModelForMultimodalLM.from_pretrained("google/pix2struct-ai2d-base") - Notebooks
- Google Colab
- Kaggle
Update preprocessor_config.json
Browse files- preprocessor_config.json +1 -0
preprocessor_config.json
CHANGED
|
@@ -3,6 +3,7 @@
|
|
| 3 |
"do_normalize": true,
|
| 4 |
"image_processor_type": "Pix2StructImageProcessor",
|
| 5 |
"max_patches": 2048,
|
|
|
|
| 6 |
"patch_size": {
|
| 7 |
"height": 16,
|
| 8 |
"width": 16
|
|
|
|
| 3 |
"do_normalize": true,
|
| 4 |
"image_processor_type": "Pix2StructImageProcessor",
|
| 5 |
"max_patches": 2048,
|
| 6 |
+
"is_vqa": true,
|
| 7 |
"patch_size": {
|
| 8 |
"height": 16,
|
| 9 |
"width": 16
|