Instructions to use google/pix2struct-widget-captioning-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use google/pix2struct-widget-captioning-base with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="google/pix2struct-widget-captioning-base")# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("google/pix2struct-widget-captioning-base") model = AutoModelForImageTextToText.from_pretrained("google/pix2struct-widget-captioning-base") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -67,7 +67,7 @@ processor.push_to_hub("USERNAME/MODEL_NAME")
|
|
| 67 |
|
| 68 |
## Running the model
|
| 69 |
|
| 70 |
-
The instructions for running the model are exactly the same as the instructions stated on [`pix2struct-textcaps-base](https://huggingface.co/google/pix2struct-textcaps-base#using-the-model) model.
|
| 71 |
|
| 72 |
# Contribution
|
| 73 |
|
|
|
|
| 67 |
|
| 68 |
## Running the model
|
| 69 |
|
| 70 |
+
The instructions for running the model are exactly the same as the instructions stated on [`pix2struct-textcaps-base`](https://huggingface.co/google/pix2struct-textcaps-base#using-the-model) model.
|
| 71 |
|
| 72 |
# Contribution
|
| 73 |
|