Fast k most similar neighbor classifier for mixed data based on approximating and eliminating

作者: Selene Hernández-Rodríguez , J. Ariel Carrasco-Ochoa , J. Fco. Martínez-Trinidad

DOI: 10.1007/978-3-540-68125-0_66

关键词:

摘要: The k nearest neighbor (k-NN) classifier has been a widely used nonparametric technique in Pattern Recognition. In order to decide the class of new prototype, k-NN performs an exhaustive comparison between prototype classify (query) and prototypes training set T. However, when T is large, expensive. To avoid this problem, many fast algorithms have developed. Some these are based on Approximating-Eliminating search. case, Approximating Eliminating steps rely triangle inequality. soft sciences, usually described by qualitative quantitative features (mixed data), sometimes function does not satisfy Therefore, work, most similar neighbour for mixed data (AEMD) presented. This consists two phases. first phase, binary similarity matrix among stored. second steps, which inequality, proposed compared against other algorithms, adapted work with data. experiments real datasets presented

参考文章(13)
Karina Figueroa, Edgar Chávez, Gonzalo Navarro, Rodrigo Paredes, On the Least Cost for Proximity Searching in Metric Spaces Experimental Algorithms. pp. 279- 290 ,(2006) , 10.1007/11764298_26
Javier Raymundo García-Serrano, José Francisco Martínez-Trinidad, Extension to C-means Algorithm for the Use of Similarity Functions european conference on principles of data mining and knowledge discovery. pp. 354- 359 ,(1999) , 10.1007/978-3-540-48247-5_42
Sumio Masuda, Ken Tokoro, Kazuaki Yamaguchi, Improvements of TLAESA nearest neighbour search algorithm and extension to approximation search ACSC '06 Proceedings of the 29th Australasian Computer Science Conference - Volume 48. pp. 77- 83 ,(2006)
Yong-Sheng Chen, Yi-Ping Hung, Ting-Fang Yen, Chiou-Shann Fuh, Fast and versatile algorithm for nearest neighbor search based on a lower bound tree Pattern Recognition. ,vol. 40, pp. 360- 375 ,(2007) , 10.1016/J.PATCOG.2005.08.016
Luisa Micó, Jose Oncina, Rafael C. Carrasco, A fast branch & bound nearest neighbour classifier in metric spaces Pattern Recognition Letters. ,vol. 17, pp. 731- 739 ,(1996) , 10.1016/0167-8655(96)00032-3
C. L. Blake, UCI Repository of machine learning databases www.ics.uci.edu/〜mlearn/MLRepository.html. ,(1998)
K. Fukunaga, P.M. Narendra, A Branch and Bound Algorithm for Computing k-Nearest Neighbors IEEE Transactions on Computers. ,vol. 24, pp. 750- 753 ,(1975) , 10.1109/T-C.1975.224297
Enrique Vidal Ruiz, An algorithm for finding nearest neighbours in (approximately) constant average time Pattern Recognition Letters. ,vol. 4, pp. 145- 157 ,(1986) , 10.1016/0167-8655(86)90013-9
T. Cover, P. Hart, Nearest neighbor pattern classification IEEE Transactions on Information Theory. ,vol. 13, pp. 21- 27 ,(1967) , 10.1109/TIT.1967.1053964