Instructions to use google/pix2struct-textcaps-large with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use google/pix2struct-textcaps-large with Transformers:
# Use a pipeline as a high-level helper # Warning: Pipeline type "image-to-text" is no longer supported in transformers v5. # You must load the model directly (see below) or downgrade to v4.x with: # 'pip install "transformers<5.0.0' from transformers import pipeline pipe = pipeline("image-to-text", model="google/pix2struct-textcaps-large")# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("google/pix2struct-textcaps-large") model = AutoModelForImageTextToText.from_pretrained("google/pix2struct-textcaps-large") - Notebooks
- Google Colab
- Kaggle
Commit History
Update README.md 1015852
Update config.json bf3e5bf
Update config.json 179255a
Update README.md f49ed78
Update README.md 9cd4270
Update README.md 539cdbd
Create README.md 3c39097
Upload processor 89ddc73
Upload Pix2StructForConditionalGeneration 77ca957
initial commit c325b58
Younes Belkada commited on