The neural-SIFT feature descriptor for visual vocabulary object recognition

作者: Sybren Jansen , Amirhosein Shantia , Marco A. Wiering

DOI: 10.1109/IJCNN.2015.7280660

关键词:

摘要: Recognizing the semantic content of an image is a challenging problem in computer vision. Many researchers attempt to apply local descriptors extract features from image, but choosing best type feature use still open problem. Some these systems are only trained once using fixed descriptor, like Scale Invariant Feature Transform (SIFT). In most cases algorithms show good performance, they do not learn their mistakes training completed. this paper continuous deep neural network feedback system proposed which consists adaptive bag visual words approach and classifier. Two initialization methods for descriptor were compared, one where it was on SIFT output randomly initialized. After initial training, propagates classification error classifier through entire pipeline, updating itself, also extract. Results that both increased accuracy substantially when regular able increase any further. The neural-SIFT performs better than itself even with limited number instances. Initializing existing beneficial lot samples available. However, there construct well-performing solely based feedback.

参考文章(48)
Michael Husken, Christian Igel, Improving the Rprop Learning Algorithm ,(2000)
Yoshua Bengio, Yoshua Bengio, Yoshua Bengio, Yann LeCun, Convolutional networks for images, speech, and time series The handbook of brain theory and neural networks. pp. 255- 258 ,(1998)
Florent Monay, Pedro Quelhas, Daniel Gatica-Perez, Jean-Marc Odobez, Constructing Visual Models with a Latent Space Approach Subspace, Latent Structure and Feature Selection. pp. 115- 126 ,(2006) , 10.1007/11752790_7
Florent Perronnin, Christopher Dance, Gabriela Csurka, Marco Bressan, Adapted Vocabularies for Generic Visual Categorization Computer Vision – ECCV 2006. pp. 464- 475 ,(2006) , 10.1007/11744085_36
G. Csurka, Visual categorization with bags of keypoints european conference on computer vision. ,vol. 1, pp. 22- ,(2004)
David E. Rumelhart, James L. McClelland, , Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations Computational Models of Cognition and Perception. ,(1986) , 10.7551/MITPRESS/5236.001.0001
Herbert Bay, Tinne Tuytelaars, Luc Van Gool, SURF: speeded up robust features european conference on computer vision. ,vol. 1, pp. 404- 417 ,(2006) , 10.1007/11744023_32
David D. Lewis, Naive (Bayes) at forty: The independence assumption in information retrieval Machine Learning: ECML-98. pp. 4- 15 ,(1998) , 10.1007/BFB0026666
Du-Ming Tsai, Boundary-based corner detection using neural networks Pattern Recognition. ,vol. 30, pp. 85- 97 ,(1997) , 10.1016/S0031-3203(96)00057-X