MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

作者: Marco Andreetto , Tobias Weyand , Hartwig Adam , Menglong Zhu , Dmitry Kalenichenko

DOI:

关键词:

摘要: We present a class of efficient models called MobileNets for mobile and embedded vision applications. MobileNets are based on a streamlined architecture that uses depth-wise separable convolutions to build light weight deep neural networks. We introduce two simple global hyper-parameters that efficiently trade off between latency and accuracy. These hyper-parameters allow the model builder to choose the right sized model for their application based on the constraints of the problem. We present extensive experiments on resource and …

参考文章(32)
Eugenio Culurciello, Jonghoon Jin, Aysegul Dundar, Flattened Convolutional Neural Networks for Feedforward Acceleration arXiv: Neural and Evolutionary Computing. ,(2014)
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Geoffrey Hinton, Oriol Vinyals, Jeff Dean, Distilling the Knowledge in a Neural Network arXiv: Machine Learning. ,(2015)
James Hays, Alexei A. Efros, Large-Scale Image Geolocalization Multimodal Location Estimation of Videos and Images. pp. 41- 62 ,(2015) , 10.1007/978-3-319-09861-6_3
Florian Schroff, Dmitry Kalenichenko, James Philbin, FaceNet: A unified embedding for face recognition and clustering computer vision and pattern recognition. pp. 815- 823 ,(2015) , 10.1109/CVPR.2015.7298682
James Hays, Alexei A. Efros, IM2GPS: estimating geographic information from a single image computer vision and pattern recognition. pp. 1- 8 ,(2008) , 10.1109/CVPR.2008.4587784
Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, Li Fei-Fei, ImageNet Large Scale Visual Recognition Challenge International Journal of Computer Vision. ,vol. 115, pp. 211- 252 ,(2015) , 10.1007/S11263-015-0816-Y
William J. Dally, William J. Dally, Song Han, Huizi Mao, Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding arXiv: Computer Vision and Pattern Recognition. ,(2015)
Ivan Oseledets, Victor Lempitsky, Yaroslav Ganin, Vadim Lebedev, Vadim Lebedev, Maksim Rakhuba, Maksim Rakhuba, Speeding-up Convolutional Neural Networks Using Fine-tuned CP-Decomposition arXiv: Computer Vision and Pattern Recognition. ,(2014)
Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg, SSD: Single Shot MultiBox Detector arXiv: Computer Vision and Pattern Recognition. ,(2015) , 10.1007/978-3-319-46448-0_2