blesot commited on
Commit
2e5bee0
·
1 Parent(s): a8aa345

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -8
README.md CHANGED
@@ -1,13 +1,11 @@
1
  Hugging Face's logo
2
  ---
3
- license: apache-2.0
4
  tags:
5
  - object-detection
6
  - vision
7
  datasets:
8
  - coco
9
 
10
-
11
  ---
12
 
13
  # Mask R-CNN
@@ -16,13 +14,20 @@ datasets:
16
 
17
  Mask R-CNN is a model that extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition. The model locates pixels of images instead of just bounding boxes as Faster R-CNN was not designed for pixel-to-pixel alignment between network inputs and outputs.
18
 
 
 
 
 
19
  ### More information on the model and dataset:
20
 
21
- ### The model
22
  Mask R-CNN works towards the approach of instance segmentation, which involves object detection, and semantic segmentation. For object detection, Mask R-CNN uses an architecture that is similar to Faster R-CNN, while it uses a Fully Convolutional Network(FCN) for semantic segmentation.
23
  The FCN is added to the top of features of a Faster R-CNN to generate a mask segmentation output. This segmentation output is in parallel with the classification and bounding box regressor network of the Faster R-CNN model. From the advancement of Fast R-CNN Region of Interest Pooling(ROI), Mask R-CNN adds refinement called ROI aligning by addressing the loss and misalignment of ROI Pooling; the new ROI aligned leads to improved results.
24
 
25
- ### Technical Specifications
 
 
 
26
 
27
  Please [read the paper](https://arxiv.org/pdf/1703.06870.pdf) for more information on training.
28
 
@@ -37,14 +42,12 @@ The model architecture is divided into two parts:
37
  - The network architectures utilized are called ResNet and ResNeXt. The depth can be either 50 or 101
38
 
39
  #### Results Summary
40
- - Instance Segmentation: Based on the COCO dataset, Mask R-CNN outperforms all categories compared to MNC and FCIS, which are state-of-the-art model.
41
  - Bounding Box Detection: Mask R-CNN outperforms the base variants of all previous state-of-the-art models, including the COCO 2016 Detection Challenge winner.
42
 
43
 
44
  ## Intended uses & limitations
45
-
46
- - With great generality, Mask RCNN can be extended to human pose estimation.
47
-
48
 
49
 
50
  ## Training Procedure
 
1
  Hugging Face's logo
2
  ---
 
3
  tags:
4
  - object-detection
5
  - vision
6
  datasets:
7
  - coco
8
 
 
9
  ---
10
 
11
  # Mask R-CNN
 
14
 
15
  Mask R-CNN is a model that extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition. The model locates pixels of images instead of just bounding boxes as Faster R-CNN was not designed for pixel-to-pixel alignment between network inputs and outputs.
16
 
17
+ *This Model is based on the Pretrained model from [OpenMMlab](https://github.com/open-mmlab/mmdetection)*
18
+
19
+ ![MMDetection](https://user-images.githubusercontent.com/12907710/137271636-56ba1cd2-b110-4812-8221-b4c120320aa9.png)
20
+
21
  ### More information on the model and dataset:
22
 
23
+ #### The model
24
  Mask R-CNN works towards the approach of instance segmentation, which involves object detection, and semantic segmentation. For object detection, Mask R-CNN uses an architecture that is similar to Faster R-CNN, while it uses a Fully Convolutional Network(FCN) for semantic segmentation.
25
  The FCN is added to the top of features of a Faster R-CNN to generate a mask segmentation output. This segmentation output is in parallel with the classification and bounding box regressor network of the Faster R-CNN model. From the advancement of Fast R-CNN Region of Interest Pooling(ROI), Mask R-CNN adds refinement called ROI aligning by addressing the loss and misalignment of ROI Pooling; the new ROI aligned leads to improved results.
26
 
27
+ #### Datasets
28
+ [COCO Datasets](https://cocodataset.org/#home)
29
+
30
+ #### Technical Specifications
31
 
32
  Please [read the paper](https://arxiv.org/pdf/1703.06870.pdf) for more information on training.
33
 
 
42
  - The network architectures utilized are called ResNet and ResNeXt. The depth can be either 50 or 101
43
 
44
  #### Results Summary
45
+ - Instance Segmentation: Based on the COCO dataset, Mask R-CNN outperforms all categories compared to MNC and FCIS, which are state-of-the-art models.
46
  - Bounding Box Detection: Mask R-CNN outperforms the base variants of all previous state-of-the-art models, including the COCO 2016 Detection Challenge winner.
47
 
48
 
49
  ## Intended uses & limitations
50
+ The identification of object relationships and the context of objects in a picture are both aided by image segmentation. Some of the applications include face recognition, number plate recognition, and satellite image analysis. With great model generality, Mask RCNN can be extended to human pose estimation; it can be used to estimate on-site approaching live traffic to aid autonomous driving
 
 
51
 
52
 
53
  ## Training Procedure