Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

作者: Shaoqing Ren , Kaiming He , Jian Sun , Xiangyu Zhang

DOI: 10.1007/978-3-319-10578-9_23

关键词:

摘要: … with another pooling strategy, “spatial pyramid pooling”, to … while still preserving the spatial pyramid pooling behaviors. … the bin sizes needed for spatial pyramid pooling. Consider the …

参考文章(41)
C. Lawrence Zitnick, Piotr Dollár, Edge Boxes: Locating Object Proposals from Edges Computer Vision – ECCV 2014. pp. 391- 405 ,(2014) , 10.1007/978-3-319-10602-1_26
Yunchao Gong, Liwei Wang, Ruiqi Guo, Svetlana Lazebnik, Multi-scale Orderless Pooling of Deep Convolutional Activation Features european conference on computer vision. pp. 392- 407 ,(2014) , 10.1007/978-3-319-10584-0_26
Florent Perronnin, Jorge Sánchez, Thomas Mensink, Improving the fisher kernel for large-scale image classification european conference on computer vision. ,vol. 6314, pp. 143- 156 ,(2010) , 10.1007/978-3-642-15561-1_11
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Shuicheng Yan, Qiang Chen, Min Lin, Network In Network arXiv: Neural and Evolutionary Computing. ,(2013)
Yann LeCun, Mikael Henaff, Michael Mathieu, Fast Training of Convolutional Networks through FFTs arXiv: Computer Vision and Pattern Recognition. ,(2013)
Ken Chatfield, Victor Lempitsky, Andrea Vedaldi, Andrew Zisserman, The devil is in the details: an evaluation of recent feature encoding methods british machine vision conference. pp. 1- 12 ,(2011) , 10.5244/C.25.76
Herve Jegou, Florent Perronnin, Matthijs Douze, Jorge Sánchez, Patrick Perez, Cordelia Schmid, Aggregating Local Image Descriptors into Compact Codes IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 34, pp. 1704- 1716 ,(2012) , 10.1109/TPAMI.2011.235
Andrea Vedaldi, Karen Simonyan, Ken Chatfield, Andrew Zisserman, Return of the Devil in the Details: Delving Deep into Convolutional Nets arXiv: Computer Vision and Pattern Recognition. ,(2014)
Ming-Ming Cheng, Ziming Zhang, Wen-Yan Lin, Philip Torr, BING: Binarized Normed Gradients for Objectness Estimation at 300fps computer vision and pattern recognition. ,vol. 5, pp. 3286- 3293 ,(2014) , 10.1109/CVPR.2014.414