作者: Georgia Gkioxari , Piotr Dollár , Kaiming He , Ross Girshick
DOI:
关键词: Image (mathematics) 、 Segmentation 、 Object (computer science) 、 Computer vision 、 Object detection 、 Computer science 、 Task (computing) 、 Overhead (computing) 、 Minimum bounding box 、 Code (cryptography) 、 Artificial intelligence
摘要: We present a conceptually simple, flexible, and general framework for object instance segmentation. Our approach efficiently detects objects in an image while simultaneously generating high-quality segmentation mask each instance. The method, called Mask R-CNN, extends Faster R-CNN by adding branch predicting parallel with the existing bounding box recognition. is simple to train adds only small overhead running at 5 fps. Moreover, easy generalize other tasks, e.g., allowing us estimate human poses same framework. show top results all three tracks of COCO suite challenges, including segmentation, bounding-box detection, person keypoint detection. Without bells whistles, outperforms existing, single-model entries on every task, 2016 challenge winners. hope our effective will serve as solid baseline help ease future research instance-level Code has been made available at: this https URL