A compression pipeline for one-stage object detection model

作者: Hongye Su , Lei Xie , Yong Liu , Yiran Sun , Guanzhong Tian

DOI: 10.1007/S11554-020-01053-Z

关键词: Pipeline (computing)Object detectionComputer scienceQuantization (image processing)Pruning (morphology)Edge deviceInteger (computer science)Floating pointAlgorithmContextual image classification

摘要: Deep neural networks (DNNs) have strong fitting ability on a variety of computer vision tasks, but they also require intensive computing power and large storage space, which are not always available in portable smart devices. Although lot studies contributed to the compression image classification networks, there few model algorithms for object detection models. In this paper, we propose general pipeline one-stage meet real-time requirements. Firstly, softer pruning strategy backbone reduce number filters. Compared with original direct pruning, our method can maintain integrity network structure drop accuracy. Secondly, transfer knowledge small by distillation accuracy caused pruning. Finally, as edge devices more suitable integer operations, further transform 32-bit floating point into 8-bit through quantization. With pipeline, size inference time compressed 10% or less original, while mAP is only reduced 2.5% less. We verified that performance Pascal VOC dataset.

参考文章(27)
Vincent Vanhoucke, Andrew Senior, Mark Z. Mao, Improving the speed of neural networks on CPUs hgpu.org. ,(2011)
Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 39, pp. 1137- 1149 ,(2017) , 10.1109/TPAMI.2016.2577031
Ross Girshick, Fast R-CNN international conference on computer vision. pp. 1440- 1448 ,(2015) , 10.1109/ICCV.2015.169
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
William J. Dally, John Tran, Song Han, Jeff Pool, Learning both Weights and Connections for Efficient Neural Networks arXiv: Neural and Evolutionary Computing. ,(2015)
S. Kullback, R. A. Leibler, On Information and Sufficiency Annals of Mathematical Statistics. ,vol. 22, pp. 79- 86 ,(1951) , 10.1214/AOMS/1177729694
Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik, Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation computer vision and pattern recognition. pp. 580- 587 ,(2014) , 10.1109/CVPR.2014.81
Yu-Wing Tai, Chi-Keung Tang, Rui Peng, Hengyuan Hu, Network Trimming: A Data-Driven Neuron Pruning Approach towards Efficient Deep Architectures arXiv: Neural and Evolutionary Computing. ,(2016)
Joseph Redmon, Ali Farhadi, YOLO9000: Better, Faster, Stronger 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 6517- 6525 ,(2017) , 10.1109/CVPR.2017.690
Ying Zhang, Tao Xiang, Timothy M Hospedales, Huchuan Lu, None, Deep Mutual Learning computer vision and pattern recognition. pp. 4320- 4328 ,(2018) , 10.1109/CVPR.2018.00454