Speeding up Convolutional Neural Networks with Low Rank Expansions

作者: Max Jaderberg , Andrea Vedaldi , Andrew Zisserman

DOI: 10.5244/C.28.88

关键词:

摘要: The focus of this paper is speeding up the application convolutional neural networks. While delivering impressive results across a range computer vision and machine learning tasks, these networks are computationally demanding, limiting their deployability. Convolutional layers generally consume bulk processing time, so in work we present two simple schemes for drastically layers. This achieved by exploiting cross-channel or filter redundancy to construct low rank basis filters that rank-1 spatial domain. Our methods architecture agnostic, can be easily applied existing CPU GPU frameworks tuneable speedup performance. We demonstrate with real world network designed scene text character recognition [15], showing possible 2.5× no loss accuracy, 4.5× less than 1% drop still achieving state-of-the-art on standard benchmarks.

参考文章(38)
Max Jaderberg, Andrea Vedaldi, Andrew Zisserman, Deep Features for Text Spotting european conference on computer vision. pp. 512- 528 ,(2014) , 10.1007/978-3-319-10593-2_34
Franck Mamalet, Christophe Garcia, Simplifying ConvNets for Fast Learning Artificial Neural Networks and Machine Learning – ICANN 2012. pp. 58- 65 ,(2012) , 10.1007/978-3-642-33266-1_8
Vincent Vanhoucke, Andrew Senior, Mark Z. Mao, Improving the speed of neural networks on CPUs hgpu.org. ,(2011)
Hyun Oh Song, Trevor Darrell, Ross Girshick, Discriminatively Activated Sparselets international conference on machine learning. pp. 196- 204 ,(2013)
Lukas Neumann, Jiri Matas, A method for text localization and recognition in real-world images asian conference on computer vision. pp. 770- 783 ,(2010) , 10.1007/978-3-642-19318-7_60
Andrea Vedaldi, Max Jaderberg, Karen Simonyan, Andrew Zisserman, Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition arXiv: Computer Vision and Pattern Recognition. ,(2014)
Sergey Karayev, Forrest N. Iandola, Kurt Keutzer, Trevor Darrell, Matthew W. Moskewicz, Ross B. Girshick, DenseNet: Implementing Efficient ConvNet Descriptor Pyramids arXiv: Computer Vision and Pattern Recognition. ,(2014)
Manik Varma, Teófilo Emídio de Campos, Bodla Rakesh Babu, CHARACTER RECOGNITION IN NATURAL IMAGES international conference on computer vision theory and applications. pp. 273- 280 ,(2009)
Andrew Y. Ng, Adam Coates, David J. Wu, Tao Wang, End-to-end text recognition with convolutional neural networks international conference on pattern recognition. pp. 3304- 3308 ,(2012)
Clément Farabet, Yann Lecun, Koray Kavukcuoglu, Berin Martini, Polina Akselrod, Selcuk Talay, Eugenio Culurciello, Large-Scale FPGA-based Convolutional Networks Cambridge University Press. pp. 399- 419 ,(2011) , 10.1017/CBO9781139042918.020