Toward Fast and Accurate Vehicle Detection in Aerial Images Using Coupled Region-Based Convolutional Neural Networks

作者: Zhipeng Deng , Hao Sun , Shilin Zhou , Juanping Zhao , Huanxin Zou

DOI: 10.1109/JSTARS.2017.2694890

关键词:

摘要: Vehicle detection in aerial images, being an interesting but challenging problem, plays important role for a wide range of applications. Traditional methods are based on sliding-window search and handcrafted or shallow-learning-based features with heavy computational costs limited representation power. Recently, deep learning algorithms, especially region-based convolutional neural networks (R-CNNs), have achieved state-of-the-art performance computer vision. However, several challenges limit the applications R-CNNs vehicle from images: 1) vehicles large-scale images relatively small size, poor localization objects; 2) particularly designed detecting bounding box targets without extracting attributes; 3) manual annotation is generally expensive available training not sufficient number. To address these problems, this paper proposes fast accurate framework. On one hand, to accurately extract vehicle-like targets, we developed accurate-vehicle-proposal-network (AVPN) hyper feature map which combines hierarchical maps that more object detection. other propose coupled R-CNN method, AVPN attribute network vehicle's location attributes simultaneously. For original annotations, use cropped image blocks data augmentation avoid overfitting. Comprehensive evaluations public Munich dataset collected demonstrate accuracy effectiveness proposed method.

参考文章(51)
Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 39, pp. 1137- 1149 ,(2017) , 10.1109/TPAMI.2016.2577031
Ross Girshick, Fast R-CNN international conference on computer vision. pp. 1440- 1448 ,(2015) , 10.1109/ICCV.2015.169
Kang Liu, Gellert Mattyus, Fast Multiclass Vehicle Detection on Aerial Images IEEE Geoscience and Remote Sensing Letters. ,vol. 12, pp. 1938- 1942 ,(2015) , 10.1109/LGRS.2015.2439517
Matthew D. Zeiler, Rob Fergus, Visualizing and Understanding Convolutional Networks european conference on computer vision. pp. 818- 833 ,(2014) , 10.1007/978-3-319-10590-1_53
Jonathan Long, Evan Shelhamer, Trevor Darrell, Fully convolutional networks for semantic segmentation computer vision and pattern recognition. pp. 3431- 3440 ,(2015) , 10.1109/CVPR.2015.7298965
Ziyi Chen, Cheng Wang, Chenglu Wen, Xiuhua Teng, Yiping Chen, Haiyan Guan, Huan Luo, Liujuan Cao, Jonathan Li, Vehicle Detection in High-Resolution Aerial Images via Sparse Representation and Superpixels IEEE Transactions on Geoscience and Remote Sensing. ,vol. 54, pp. 103- 116 ,(2016) , 10.1109/TGRS.2015.2451002
Bharath Hariharan, Pablo Arbelaez, Ross Girshick, Jitendra Malik, Hypercolumns for object segmentation and fine-grained localization computer vision and pattern recognition. pp. 447- 456 ,(2015) , 10.1109/CVPR.2015.7298642
Thomas Moranduzzo, Farid Melgani, A SIFT-SVM method for detecting cars in UAV images international geoscience and remote sensing symposium. pp. 6868- 6871 ,(2012) , 10.1109/IGARSS.2012.6352585
Hsu-Yung Cheng, Chih-Chia Weng, Yi-Ying Chen, Vehicle Detection in Aerial Surveillance Using Dynamic Bayesian Networks IEEE Transactions on Image Processing. ,vol. 21, pp. 2152- 2159 ,(2012) , 10.1109/TIP.2011.2172798
Xiaoqiang Lu, Xuelong Li, Lichao Mou, Semi-Supervised Multitask Learning for Scene Recognition IEEE Transactions on Systems, Man, and Cybernetics. ,vol. 45, pp. 1967- 1976 ,(2015) , 10.1109/TCYB.2014.2362959