blesot
/

Mask-RCNN

Model card Files Files and versions

xet

Community

blesot commited on Aug 10, 2022

Commit

2e5bee0

1 Parent(s): a8aa345

Update README.md

Browse files

Files changed (1) hide show

README.md +11 -8

README.md CHANGED Viewed

@@ -1,13 +1,11 @@
 Hugging Face's logo
 ---
-license: apache-2.0
 tags:
 - object-detection
 - vision
 datasets:
 - coco
 ---
 # Mask R-CNN
@@ -16,13 +14,20 @@ datasets:
 Mask R-CNN is a model that extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition. The model locates pixels of images instead of just bounding boxes as Faster R-CNN was not designed for pixel-to-pixel alignment between network inputs and outputs.
 ### More information on the model and dataset:
-### The model
 Mask R-CNN works towards the approach of instance segmentation, which involves object detection, and semantic segmentation. For object detection, Mask R-CNN uses an architecture that is similar to Faster R-CNN, while it uses a Fully Convolutional Network(FCN) for semantic segmentation.
 The FCN is added to the top of features of a Faster R-CNN to generate a mask segmentation output. This segmentation output is in parallel with the classification and bounding box regressor network of the Faster R-CNN model. From the advancement of Fast R-CNN Region of Interest Pooling(ROI), Mask R-CNN adds refinement called ROI aligning by addressing the loss and misalignment of ROI Pooling; the new ROI aligned leads to improved results.
-### Technical Specifications
 Please [read the paper](https://arxiv.org/pdf/1703.06870.pdf) for more information on training.
@@ -37,14 +42,12 @@ The model architecture is divided into two parts:
 -  The network architectures utilized are called ResNet and ResNeXt. The depth can be either 50 or 101
 #### Results Summary
-- Instance Segmentation: Based on the COCO dataset, Mask R-CNN outperforms all categories compared to MNC and FCIS, which are state-of-the-art model.
 - Bounding Box Detection: Mask R-CNN outperforms the base variants of all previous state-of-the-art models, including the COCO 2016 Detection Challenge winner.
 ## Intended uses & limitations
-- With great generality, Mask RCNN can be extended to human pose estimation.
 ## Training Procedure

 Hugging Face's logo
 ---
 tags:
 - object-detection
 - vision
 datasets:
 - coco
 ---
 # Mask R-CNN
 Mask R-CNN is a model that extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition. The model locates pixels of images instead of just bounding boxes as Faster R-CNN was not designed for pixel-to-pixel alignment between network inputs and outputs.
+*This Model is based on the Pretrained model from [OpenMMlab](https://github.com/open-mmlab/mmdetection)*
+![MMDetection](https://user-images.githubusercontent.com/12907710/137271636-56ba1cd2-b110-4812-8221-b4c120320aa9.png)
 ### More information on the model and dataset:
+#### The model
 Mask R-CNN works towards the approach of instance segmentation, which involves object detection, and semantic segmentation. For object detection, Mask R-CNN uses an architecture that is similar to Faster R-CNN, while it uses a Fully Convolutional Network(FCN) for semantic segmentation.
 The FCN is added to the top of features of a Faster R-CNN to generate a mask segmentation output. This segmentation output is in parallel with the classification and bounding box regressor network of the Faster R-CNN model. From the advancement of Fast R-CNN Region of Interest Pooling(ROI), Mask R-CNN adds refinement called ROI aligning by addressing the loss and misalignment of ROI Pooling; the new ROI aligned leads to improved results.
+#### Datasets
+[COCO Datasets](https://cocodataset.org/#home)
+#### Technical Specifications
 Please [read the paper](https://arxiv.org/pdf/1703.06870.pdf) for more information on training.
 -  The network architectures utilized are called ResNet and ResNeXt. The depth can be either 50 or 101
 #### Results Summary
+- Instance Segmentation: Based on the COCO dataset, Mask R-CNN outperforms all categories compared to MNC and FCIS, which are state-of-the-art models.
 - Bounding Box Detection: Mask R-CNN outperforms the base variants of all previous state-of-the-art models, including the COCO 2016 Detection Challenge winner.
 ## Intended uses & limitations
+The identification of object relationships and the context of objects in a picture are both aided by image segmentation. Some of the applications include face recognition, number plate recognition, and satellite image analysis. With great model generality, Mask RCNN can be extended to human pose estimation; it can be used to estimate on-site approaching live traffic to aid autonomous driving
 ## Training Procedure