Working hard to know your neighbor's margins:Local descriptor learning loss

作者: Filip Radenovic , Dmytro Mishkin , Jiri Matas , Anastasiya Mishchuk

DOI:

关键词:

摘要: We introduce a novel loss for learning local feature descriptors which is inspired by the Lowe's matching criterion SIFT. show that proposed maximizes distance between closest positive and negative patch in batch better than complex regularization methods; it works well both shallow deep convolution network architectures. Applying to L2Net CNN architecture results compact descriptor -- has same dimensionality as SIFT (128) shows state-of-art performance wide baseline stereo, verification instance retrieval benchmarks. It fast, computing takes about 1 millisecond on low-end GPU.

参考文章(25)
David G. Lowe, Marius Muja, FAST APPROXIMATE NEAREST NEIGHBORS WITH AUTOMATIC ALGORITHM CONFIGURATION international conference on computer vision theory and applications. pp. 331- 340 ,(2009)
Karel Lenc, Michal Perdoch, Dmytro Mishkin, Jiri Matas, WxBS: Wide Baseline Stereo Generalizations arXiv: Computer Vision and Pattern Recognition. ,(2015)
Karen Simonyan, Andrea Vedaldi, Andrew Zisserman, Descriptor Learning Using Convex Optimisation Computer Vision – ECCV 2012. pp. 243- 256 ,(2012) , 10.1007/978-3-642-33718-5_18
Jingming Dong, Stefano Soatto, Domain-size pooling in local descriptors: DSP-SIFT computer vision and pattern recognition. pp. 5097- 5106 ,(2015) , 10.1109/CVPR.2015.7299145
Xufeng Han, Thomas Leung, Yangqing Jia, Rahul Sukthankar, Alexander C. Berg, MatchNet: Unifying feature and metric learning for patch-based matching computer vision and pattern recognition. pp. 3279- 3286 ,(2015) , 10.1109/CVPR.2015.7298948
Michal Perd'och, Ondrej Chum, Jiri Matas, Efficient representation of local geometry for large scale object retrieval computer vision and pattern recognition. pp. 9- 16 ,(2009) , 10.1109/CVPR.2009.5206529
R. Arandjelovic, A. Zisserman, Three things everyone should know to improve object retrieval computer vision and pattern recognition. pp. 2911- 2918 ,(2012) , 10.1109/CVPR.2012.6248018
Hervé Jégou, Matthijs Douze, Cordelia Schmid, Improving Bag-of-Features for Large Scale Image Search International Journal of Computer Vision. ,vol. 87, pp. 316- 336 ,(2010) , 10.1007/S11263-009-0285-2
D. C. Hauagge, N. Snavely, Image matching using local symmetry features computer vision and pattern recognition. pp. 206- 213 ,(2012) , 10.1109/CVPR.2012.6247677
Andrew M. Saxe, James L. McClelland, Surya Ganguli, Exact solutions to the nonlinear dynamics of learning in deep linear neural networks arXiv: Neural and Evolutionary Computing. ,(2013)