rabbit commited on
Commit ยท
2adbcf5
1
Parent(s): bfda15f
update
Browse files
README.md
CHANGED
|
@@ -20,6 +20,7 @@ Doc2Bit-VL-7B is a vision-language model fine-tuned for document understanding.
|
|
| 20 |
A Vision-Language Model (VLM) specialized in **document understanding and information extraction**, supporting both **unstructured information** and **structured data (tables)** from document images.
|
| 21 |
|
| 22 |
This model is optimized for production usage via **vLLM serving** with an **OpenAI-compatible API**.
|
|
|
|
| 23 |
---
|
| 24 |
## ๐ Features
|
| 25 |
- Vision-Language Model for document images
|
|
@@ -32,6 +33,7 @@ This model is optimized for production usage via **vLLM serving** with an **Open
|
|
| 32 |
---
|
| 33 |
## ๐ Supported Data Types
|
| 34 |
### 1. Unstructured Information
|
|
|
|
| 35 |
Extract specific fields defined by the user, such as:
|
| 36 |
- Invoice number
|
| 37 |
- Date
|
|
@@ -41,6 +43,7 @@ Extract specific fields defined by the user, such as:
|
|
| 41 |
- Custom document attributes
|
| 42 |
---
|
| 43 |
### 2. Structured Table Data
|
|
|
|
| 44 |
Designed for extracting **individual columns** from tables, especially product tables.
|
| 45 |
Capabilities:
|
| 46 |
- Column-level extraction
|
|
|
|
| 20 |
A Vision-Language Model (VLM) specialized in **document understanding and information extraction**, supporting both **unstructured information** and **structured data (tables)** from document images.
|
| 21 |
|
| 22 |
This model is optimized for production usage via **vLLM serving** with an **OpenAI-compatible API**.
|
| 23 |
+
|
| 24 |
---
|
| 25 |
## ๐ Features
|
| 26 |
- Vision-Language Model for document images
|
|
|
|
| 33 |
---
|
| 34 |
## ๐ Supported Data Types
|
| 35 |
### 1. Unstructured Information
|
| 36 |
+
|
| 37 |
Extract specific fields defined by the user, such as:
|
| 38 |
- Invoice number
|
| 39 |
- Date
|
|
|
|
| 43 |
- Custom document attributes
|
| 44 |
---
|
| 45 |
### 2. Structured Table Data
|
| 46 |
+
|
| 47 |
Designed for extracting **individual columns** from tables, especially product tables.
|
| 48 |
Capabilities:
|
| 49 |
- Column-level extraction
|