Caraaaaa
/

text_image_captioning

Model card Files Files and versions

Caraaaaa commited on Jan 20, 2024

Commit

43f378e

·

verified ·

1 Parent(s): 69869a8

Update README.md

Files changed (1) hide show

README.md +14 -1

README.md CHANGED Viewed

@@ -2,4 +2,17 @@
 datasets:
 - Caraaaaa/non_text_image_captioning
 pipeline_tag: image-to-text
----

 datasets:
 - Caraaaaa/non_text_image_captioning
 pipeline_tag: image-to-text
+---
+This model is finetuned on non-text images extracted from documents (i.e.PDF). It is used to analyze the content of the image and produce a descriptive caption.
+It is part of a project to build a software solution capable of processing offline documents (PDFs, Word, PowerPoint, PPT, etc.) to detect WCAG accessibility issues.
+Example document with non-text images:
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/64b539ab4dd3e248953a6e69/IlcbNsHuzK5JHHixh_dwN.png)
+Extracted Image:
+![Alt text](https://datasets-server.huggingface.co/assets/Caraaaaa/non_text_image_captioning/--/ca73cb435a60096ff7194f9616a54fde01f69039/--/default/train/10/image/image.jpg)
+Generated caption:
+"Indication of correct signature"
+- [Training script](https://colab.research.google.com/drive/1QYvXdi0V1AXqlBMR8MpyydNMnK_Vt4dU?usp=sharing)
+- [Github repo](https://github.com/caraaaaa/doc_accessibility?tab=readme-ov-file)