UI_Screen_Description_Generator_with_Pix2Struct

Sleeping

pankti0919 commited on Apr 29, 2025

Commit

c41fe9c

verified ·

1 Parent(s): 981337a

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -12,3 +12,19 @@ short_description: Built a vision-language application
 ---
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
+# UI Screen Describer with Pix2Struct
+This demo uses Google's `pix2struct-screen2words-large` model to turn UI screenshots into natural language descriptions.
+### Use Cases
+- Accessibility
+- UI testing
+- Auto documentation
+### How it works
+Upload any screenshot (e.g., app, webpage, dashboard) and the model will describe it in text.
+Built using Hugging Face Transformers + Gradio.