File size: 811 Bytes
234d6ca
 
 
23c5705
 
234d6ca
1a34034
9a19607
 
aec4ac5
 
 
7e0b4d3
aec4ac5
7e0b4d3
aec4ac5
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
---
license: unknown
---
The purpose of this model is to classify a single document page image to define if it is the beginning page of the document or the middle/end page of a document.
Single-page documents are classified as beginning page. It is a first step of the more general document boundary classification problem.

To generate the embeddings use ```google/siglip2-so400m-patch16-512``` with no fine tuning.
You have a tiny script in generate_embeddings.py to generate a pickle file with the embeddings, provided a Pandas DataFrame ```tasks_df``` with a col ```"image_path"``` that contains all the images paths.

Then you can use the resulting embeddings with the model here uploaded.

Output meaning:

0 -> Middle or end page of a document

1 -> Beginning page of a document (or single-page document)