Data Description
Within the EntitySeg dataset downloaded from the official repo EntitySeg-Dataset, we offer both the high-resolution (original) images as well as their downsized, low-resolution counterparts. The term lr denotes these low-resolution versions. Comprehensive details about the dataset can be found in the files: entityv2_010203_entity_train.json and entityv2_010203_entity_val.json. Any additional annotations have been derived or processed based on these two primary annotation files.
For the annotation formats, we strictly follow the dataloader format of detectron2 in semantic, instance and panoptic segmentation. The entity segmentation annotations follow the COCO instance segmentation format.
βdetectron2
βββ ...
βββ projects
β βββCropFormer
β β βββ ...
βββ datasets
β βββ coco
β β βββ annotations
β β βββ train2017
β β βββ val2017
β βββ entityseg
β βββimages
β β βββentity_01_11580
β β βββentity_02_11598
β β βββentity_03_10049
β βββimages_lr
β β βββentity_01_11580
β β βββentity_02_11598
β β βββentity_03_10049
β βββ annotations
β β βββentity_segmentation
β β β βββentityv2_01_entity_train.json
β β β βββentityv2_02_entity_train.json
β β β βββentityv2_03_entity_train.json
β β β βββentityv2_010203_entity_train.json
β β β βββentityv2_010203_entity_train_lr.json
β β β βββentityv2_010203_entity_val.json
β β β βββentityv2_010203_entity_val_lr.json
β β βββinstance_segmentation
β β β βββentityv2_01_instances_train.json
β β β βββentityv2_01_instances_val.json
β β βββsemantic_segmentation
β β β βββsemantic_maps_train
β β β βββsemantic_maps_val
β β β βββtrain.txt
β β β βββval.txt
β β βββpanoptic_segmentation
β β β βββpanoptic_maps_train
β β β βββpanoptic_maps_val
β β β βββentityv2_01_panoptic_train.json
ββββββββββββββββββentityv2_01_panoptic_val.json
Low-Resolution Entity Annotations
For our low-resolution versions, we resize the original images and annotations such that the shortest side 800 pixels and the longest side 1333 pixels. While resizing, we employ bilinear interpolation for the images and nearest-neighbor interpolation for the annotations.
We resized the images primarily because evaluating high-resolution images demands significant memory, which may not be feasible for everyone. Instead, you can evaluate the segmentation model on the low-resolution images.
Instance Segmentation Annotations
We selected the top 150 categories for instance segmentation based on entity frequency.
Semantic Segmentation Annotations
We selected the top 150 categories for semantic segmentation based on pixel frequency.
Panoptic Segmentation Annotations
We selected the top 350 categories for panoptic segmentation based on entity frequency.
Code about Category Information
import mmcv
mmcv.load("entityv2_01_train.json")["categories"]