R2CNN++: Multi-Dimensional Attention Based Rotation Invariant Detector with Robust Anchor Strategy.

作者: Xue Yang , Hao Sun , Zhi Guo , Jirui Yang , Tengfei Zhang

DOI:

关键词:

摘要: Object detection has been a building block in computer vision. Though considerable progress made, there still exist challenges for objects with small size, arbitrary direction, and dense distribution. Apart from natural images, such issues are especially pronounced aerial images of great importance. This paper presents novel multi-category rotation detector small, cluttered rotated objects, namely SCRDet. Specifically, sampling fusion network is devised which fuses multi-layer feature effective anchor sampling, to improve the sensitivity objects. Meanwhile, supervised pixel attention channel jointly explored object by suppressing noise highlighting feature. For more accurate estimation, IoU constant factor added smooth L1 loss address boundary problem rotating bounding box. Extensive experiments on two remote sensing public datasets DOTA, NWPU VHR-10 as well image COCO, VOC2007 scene text data ICDAR2015 show state-of-the-art performance our detector. The code models will be available at this https URL.

参考文章(27)
Ross Girshick, Fast R-CNN international conference on computer vision. pp. 1440- 1448 ,(2015) , 10.1109/ICCV.2015.169
Philipp Fischer, Thomas Brox, None, U-Net: Convolutional Networks for Biomedical Image Segmentation medical image computing and computer assisted intervention. pp. 234- 241 ,(2015) , 10.1007/978-3-319-24574-4_28
M Sai Praneeth, Xudong Peng, Alice Li, Shahrzad Hosseini Vajargah, Going deeper with convolutions computer vision and pattern recognition. pp. 1- 9 ,(2015) , 10.1109/CVPR.2015.7298594
Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik, Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation computer vision and pattern recognition. pp. 580- 587 ,(2014) , 10.1109/CVPR.2014.81
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 37, pp. 1904- 1916 ,(2015) , 10.1109/TPAMI.2015.2389824
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition computer vision and pattern recognition. pp. 770- 778 ,(2016) , 10.1109/CVPR.2016.90
Jifeng Dai, Kaiming He, Jian Sun, Yi Li, R-FCN: Object Detection via Region-based Fully Convolutional Networks neural information processing systems. ,vol. 29, pp. 379- 387 ,(2016)
Gong Cheng, Peicheng Zhou, Junwei Han, Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images IEEE Transactions on Geoscience and Remote Sensing. ,vol. 54, pp. 7405- 7415 ,(2016) , 10.1109/TGRS.2016.2601622
Tsung-Yi Lin, Piotr Dollar, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie, Feature Pyramid Networks for Object Detection computer vision and pattern recognition. pp. 936- 944 ,(2017) , 10.1109/CVPR.2017.106
Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, Yichen Wei, Deformable Convolutional Networks international conference on computer vision. pp. 764- 773 ,(2017) , 10.1109/ICCV.2017.89