drozdgk's picture
chore: vendor third_party (remove submodules, ignore artifacts)
352cafd
|
raw
history blame
3.6 kB
## Data Description
Within the EntitySeg dataset downloaded from the official repo [EntitySeg-Dataset](https://github.com/adobe-research/EntitySeg-Dataset/releases/tag/v1.0), we offer both the high-resolution (original) images as well as their downsized, low-resolution counterparts. The term `lr` denotes these low-resolution versions. Comprehensive details about the dataset can be found in the files: `entityv2_010203_entity_train.json` and `entityv2_010203_entity_val.json`. Any additional annotations have been derived or processed based on these two primary annotation files.
For the annotation formats, we strictly follow the dataloader format of detectron2 in semantic, instance and panoptic segmentation. The entity segmentation annotations follow the COCO instance segmentation format.
```
β”œdetectron2
β”œβ”€β”€ ...
β”œβ”€β”€ projects
β”‚ β”œβ”€β”€CropFormer
β”‚ β”‚ β”œβ”€β”€ ...
β”œβ”€β”€ datasets
β”‚ β”œβ”€β”€ coco
β”‚ β”‚ β”œβ”€β”€ annotations
β”‚ β”‚ β”œβ”€β”€ train2017
β”‚ β”‚ └── val2017
β”‚ └── entityseg
β”‚ β”œβ”€β”€images
β”‚ β”‚ β”œβ”€β”€entity_01_11580
β”‚ β”‚ β”œβ”€β”€entity_02_11598
β”‚ β”‚ └──entity_03_10049
β”‚ │──images_lr
β”‚ β”‚ β”œβ”€β”€entity_01_11580
β”‚ β”‚ β”œβ”€β”€entity_02_11598
β”‚ β”‚ └──entity_03_10049
β”‚ β”œβ”€β”€ annotations
β”‚ β”‚ β”œβ”€β”€entity_segmentation
β”‚ β”‚ β”‚ β”œβ”€β”€entityv2_01_entity_train.json
β”‚ β”‚ β”‚ β”œβ”€β”€entityv2_02_entity_train.json
β”‚ β”‚ β”‚ β”œβ”€β”€entityv2_03_entity_train.json
β”‚ β”‚ β”‚ β”œβ”€β”€entityv2_010203_entity_train.json
β”‚ β”‚ β”‚ β”œβ”€β”€entityv2_010203_entity_train_lr.json
β”‚ β”‚ β”‚ β”œβ”€β”€entityv2_010203_entity_val.json
β”‚ β”‚ β”‚ └──entityv2_010203_entity_val_lr.json
β”‚ β”‚ β”œβ”€β”€instance_segmentation
β”‚ β”‚ β”‚ β”œβ”€β”€entityv2_01_instances_train.json
β”‚ β”‚ β”‚ └──entityv2_01_instances_val.json
β”‚ β”‚ β”œβ”€β”€semantic_segmentation
β”‚ β”‚ β”‚ β”œβ”€β”€semantic_maps_train
β”‚ β”‚ β”‚ β”œβ”€β”€semantic_maps_val
β”‚ β”‚ β”‚ β”œβ”€β”€train.txt
β”‚ β”‚ β”‚ └──val.txt
β”‚ β”‚ β”œβ”€β”€panoptic_segmentation
β”‚ β”‚ β”‚ β”œβ”€β”€panoptic_maps_train
β”‚ β”‚ β”‚ β”œβ”€β”€panoptic_maps_val
β”‚ β”‚ β”‚ β”œβ”€β”€entityv2_01_panoptic_train.json
β”œβ”€β”€β”€β”€β”€β”€β”€β””β”€β”€β”€β””β”€β”€β””β”€β”€entityv2_01_panoptic_val.json
```
#### Low-Resolution Entity Annotations
For our low-resolution versions, we resize the original images and annotations such that the shortest side 800 pixels and the longest side 1333 pixels. While resizing, we employ bilinear interpolation for the images and nearest-neighbor interpolation for the annotations.
We resized the images primarily because evaluating high-resolution images demands significant memory, which may not be feasible for everyone. Instead, you can evaluate the segmentation model on the low-resolution images.
#### Instance Segmentation Annotations
We selected the top 150 categories for instance segmentation based on entity frequency.
#### Semantic Segmentation Annotations
We selected the top 150 categories for semantic segmentation based on pixel frequency.
#### Panoptic Segmentation Annotations
We selected the top 350 categories for panoptic segmentation based on entity frequency.
#### Code about Category Information
```
import mmcv
mmcv.load("entityv2_01_train.json")["categories"]
```