| ## Data Description | |
| Within the EntitySeg dataset downloaded from the official repo [EntitySeg-Dataset](https://github.com/adobe-research/EntitySeg-Dataset/releases/tag/v1.0), we offer both the high-resolution (original) images as well as their downsized, low-resolution counterparts. The term `lr` denotes these low-resolution versions. Comprehensive details about the dataset can be found in the files: `entityv2_010203_entity_train.json` and `entityv2_010203_entity_val.json`. Any additional annotations have been derived or processed based on these two primary annotation files. | |
| For the annotation formats, we strictly follow the dataloader format of detectron2 in semantic, instance and panoptic segmentation. The entity segmentation annotations follow the COCO instance segmentation format. | |
| ``` | |
| βdetectron2 | |
| βββ ... | |
| βββ projects | |
| β βββCropFormer | |
| β β βββ ... | |
| βββ datasets | |
| β βββ coco | |
| β β βββ annotations | |
| β β βββ train2017 | |
| β β βββ val2017 | |
| β βββ entityseg | |
| β βββimages | |
| β β βββentity_01_11580 | |
| β β βββentity_02_11598 | |
| β β βββentity_03_10049 | |
| β βββimages_lr | |
| β β βββentity_01_11580 | |
| β β βββentity_02_11598 | |
| β β βββentity_03_10049 | |
| β βββ annotations | |
| β β βββentity_segmentation | |
| β β β βββentityv2_01_entity_train.json | |
| β β β βββentityv2_02_entity_train.json | |
| β β β βββentityv2_03_entity_train.json | |
| β β β βββentityv2_010203_entity_train.json | |
| β β β βββentityv2_010203_entity_train_lr.json | |
| β β β βββentityv2_010203_entity_val.json | |
| β β β βββentityv2_010203_entity_val_lr.json | |
| β β βββinstance_segmentation | |
| β β β βββentityv2_01_instances_train.json | |
| β β β βββentityv2_01_instances_val.json | |
| β β βββsemantic_segmentation | |
| β β β βββsemantic_maps_train | |
| β β β βββsemantic_maps_val | |
| β β β βββtrain.txt | |
| β β β βββval.txt | |
| β β βββpanoptic_segmentation | |
| β β β βββpanoptic_maps_train | |
| β β β βββpanoptic_maps_val | |
| β β β βββentityv2_01_panoptic_train.json | |
| ββββββββββββββββββentityv2_01_panoptic_val.json | |
| ``` | |
| #### Low-Resolution Entity Annotations | |
| For our low-resolution versions, we resize the original images and annotations such that the shortest side 800 pixels and the longest side 1333 pixels. While resizing, we employ bilinear interpolation for the images and nearest-neighbor interpolation for the annotations. | |
| We resized the images primarily because evaluating high-resolution images demands significant memory, which may not be feasible for everyone. Instead, you can evaluate the segmentation model on the low-resolution images. | |
| #### Instance Segmentation Annotations | |
| We selected the top 150 categories for instance segmentation based on entity frequency. | |
| #### Semantic Segmentation Annotations | |
| We selected the top 150 categories for semantic segmentation based on pixel frequency. | |
| #### Panoptic Segmentation Annotations | |
| We selected the top 350 categories for panoptic segmentation based on entity frequency. | |
| #### Code about Category Information | |
| ``` | |
| import mmcv | |
| mmcv.load("entityv2_01_train.json")["categories"] | |
| ``` |