Reversible Recursive Instance-Level Object Segmentation

作者: Xiaodan Liang , Yunchao Wei , Xiaohui Shen , Zequn Jie , Jiashi Feng

DOI: 10.1109/CVPR.2016.75

关键词:

摘要: In this work, we propose a novel Reversible Recursive Instance-level Object Segmentation (R2-IOS) framework to address the challenging instance-level object segmentation task. R2-IOS consists of reversible proposal refinement sub-network that predicts bounding box offsets for refining locations, and an generates foreground mask dominant instance in each proposal. By being recursive, iteratively optimizes two subnetworks during joint training, which refined proposals improved predictions are alternately fed into other progressively increase network capabilities. reversible, adaptively determines optimal number iterations required both training testing. Furthermore, handle multiple overlapped instances within proposal, instance-aware denoising autoencoder is introduced distinguish from distracting instances. Extensive experiments on PASCAL VOC 2012 benchmark well demonstrate superiority over state-of-the-art methods. particular, APr 20 classes at 0:5 IoU achieves 66:7%, significantly outperforms results 58:7% by PFN [17] 46:3% [22].

参考文章(38)
C. Lawrence Zitnick, Piotr Dollár, Edge Boxes: Locating Object Proposals from Edges Computer Vision – ECCV 2014. pp. 391- 405 ,(2014) , 10.1007/978-3-319-10602-1_26
Nathan Silberman, David Sontag, Rob Fergus, Instance Segmentation of Indoor Scenes Using a Coverage Loss european conference on computer vision. pp. 616- 631 ,(2014) , 10.1007/978-3-319-10590-1_40
Ziyu Zhang, Alexander G. Schwing, Sanja Fidler, Raquel Urtasun, Monocular Object Instance Segmentation and Depth Ordering with CNNs 2015 IEEE International Conference on Computer Vision (ICCV). pp. 2614- 2622 ,(2015) , 10.1109/ICCV.2015.300
Ronan Collobert, Piotr Dollár, Pedro O. Pinheiro, Learning to segment object candidates neural information processing systems. ,vol. 28, pp. 1990- 1998 ,(2015)
Jifeng Dai, Kaiming He, Jian Sun, BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation international conference on computer vision. pp. 1635- 1643 ,(2015) , 10.1109/ICCV.2015.191
Ross Girshick, Jitendra Malik, Bharath Hariharan, Pablo Arbeláez, Simultaneous Detection and Segmentation european conference on computer vision. pp. 297- 312 ,(2014) , 10.1007/978-3-319-10584-0_20
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, Yoshua Bengio, None, Show, Attend and Tell: Neural Image Caption Generation with Visual Attention international conference on machine learning. ,vol. 3, pp. 2048- 2057 ,(2015)
Ross Girshick, Fast R-CNN international conference on computer vision. pp. 1440- 1448 ,(2015) , 10.1109/ICCV.2015.169
Si Liu, Xiaodan Liang, Luoqi Liu, Ke Lu, Liang Lin, Xiaochun Cao, Shuicheng Yan, Fashion Parsing With Video Context IEEE Transactions on Multimedia. ,vol. 17, pp. 1347- 1358 ,(2015) , 10.1109/TMM.2015.2443559
Yoshua Bengio, Tomas Mikolov, Razvan Pascanu, On the difficulty of training recurrent neural networks international conference on machine learning. pp. 1310- 1318 ,(2013)