Spaces:
Running
on
Zero
Running
on
Zero
| title: UI Screen Description Generator With Pix2Struct | |
| emoji: 🐨 | |
| colorFrom: purple | |
| colorTo: blue | |
| sdk: gradio | |
| sdk_version: 5.28.0 | |
| app_file: app.py | |
| pinned: false | |
| license: mit | |
| short_description: Built a vision-language application | |
| Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference | |
| # UI Screen Describer with Pix2Struct | |
| This demo uses Google's `pix2struct-screen2words-large` model to turn UI screenshots into natural language descriptions. | |
| ### Use Cases | |
| - Accessibility | |
| - UI testing | |
| - Auto documentation | |
| ### How it works | |
| Upload any screenshot (e.g., app, webpage, dashboard) and the model will describe it in text. | |
| Built using Hugging Face Transformers + Gradio. | |