Update README.md
Browse files
README.md
CHANGED
|
@@ -24,7 +24,7 @@ license: apache-2.0
|
|
| 24 |
|
| 25 |
## Model description
|
| 26 |
|
| 27 |
-
DocExplainerV0 is a **first-step approach** to Visual Document Question Answering
|
| 28 |
Unlike standard VLMs that only provide text-based answers, DocExplainerV0 adds **visual evidence through bounding boxes**, making model predictions more interpretable.
|
| 29 |
It is designed as a **plug-and-play module** to be combined with existing Vision-Language Models (VLMs), decoupling answer generation from spatial grounding.
|
| 30 |
|
|
|
|
| 24 |
|
| 25 |
## Model description
|
| 26 |
|
| 27 |
+
DocExplainerV0 is a **first-step approach** to Visual Document Question Answering with bounding box localization.
|
| 28 |
Unlike standard VLMs that only provide text-based answers, DocExplainerV0 adds **visual evidence through bounding boxes**, making model predictions more interpretable.
|
| 29 |
It is designed as a **plug-and-play module** to be combined with existing Vision-Language Models (VLMs), decoupling answer generation from spatial grounding.
|
| 30 |
|