Update README.md
Browse files
README.md
CHANGED
|
@@ -26,9 +26,29 @@ TF-ID models take an image of a single paper page as the input, and return bound
|
|
| 26 |
TF-ID-base and TF-ID-large draw bounding boxes around tables/figures and their caption text.
|
| 27 |
TF-ID-base-no-caption and TF-ID-large-no-caption draw bounding boxes around tables/figures without their caption text.
|
| 28 |
|
|
|
|
|
|
|
| 29 |
Object Detection results format:
|
| 30 |
{'\<OD>': {'bboxes': [[x1, y1, x2, y2], ...],
|
| 31 |
'labels': ['label1', 'label2', ...]} }
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 32 |
|
| 33 |
## How to Get Started with the Model
|
| 34 |
|
|
|
|
| 26 |
TF-ID-base and TF-ID-large draw bounding boxes around tables/figures and their caption text.
|
| 27 |
TF-ID-base-no-caption and TF-ID-large-no-caption draw bounding boxes around tables/figures without their caption text.
|
| 28 |
|
| 29 |
+

|
| 30 |
+
|
| 31 |
Object Detection results format:
|
| 32 |
{'\<OD>': {'bboxes': [[x1, y1, x2, y2], ...],
|
| 33 |
'labels': ['label1', 'label2', ...]} }
|
| 34 |
+
|
| 35 |
+
## Benchmarks
|
| 36 |
+
|
| 37 |
+
We tested the models on paper pages outside the training dataset. The papers are a subset of huggingface daily paper.
|
| 38 |
+
|
| 39 |
+
Correct output - the model draws correct bounding boxes for every table/figure in the given page.
|
| 40 |
+
|
| 41 |
+
| Model | Total Images | Correct Output | Success Rate |
|
| 42 |
+
|---------------------------------------------------------------|--------------|----------------|--------------|
|
| 43 |
+
| TF-ID-base[[HF]](https://huggingface.co/yifeihu/TF-ID-base) | 258 | 251 | 97.29% |
|
| 44 |
+
| TF-ID-large[[HF]](https://huggingface.co/yifeihu/TF-ID-large) | 258 | 253 | 98.06% |
|
| 45 |
+
|
| 46 |
+
| Model | Total Images | Correct Output | Success Rate |
|
| 47 |
+
|---------------------------------------------------------------|--------------|----------------|--------------|
|
| 48 |
+
| TF-ID-base-no-caption[[HF]](https://huggingface.co/yifeihu/TF-ID-base-no-caption) | 261 | 253 | 96.93% |
|
| 49 |
+
| TF-ID-large-no-caption[[HF]](https://huggingface.co/yifeihu/TF-ID-large-no-caption) | 261 | 254 | 97.32% |
|
| 50 |
+
|
| 51 |
+
Depending on the use cases, some "incorrect" output could be totally usable. For example, the model draw two bounding boxes for one figure with two child components.
|
| 52 |
|
| 53 |
## How to Get Started with the Model
|
| 54 |
|