Similarity learning in nearest neighbor and application to information retrieval

作者: Ali Mustafa Qamar , Eric Gaussier

DOI: 10.14236/EWIC/FDIA2009.22

关键词:

摘要: Many people have tried to learn Mahanalobis distance metric in kNN classification by considering the geometry of space containing examples. However, similarity may an edge specially while dealing with text e.g. Information Retrieval. We proposed online algorithm, SiLA (Similarity learning algorithm) where aim is a (e.g. cosine measure, Dice and Jaccard coefficients) its variation eSiLA we project matrix learnt onto cone positive, semidefinite matrices. Two incremental algorithms been developed; one based on standard rule other symmetric version. can be used Retrievalwhere performance improved using user feedback.

参考文章(5)
Yoav Freund, Robert E. Schapire, Large margin classification using the perceptron algorithm conference on learning theory. ,vol. 37, pp. 209- 217 ,(1998) , 10.1145/279943.279985
Shai Shalev-Shwartz, Yoram Singer, Andrew Y. Ng, Online and batch learning of pseudo-metrics Twenty-first international conference on Machine learning - ICML '04. pp. 94- ,(2004) , 10.1145/1015330.1015376
Ali Mustafa Qamar, Eric Gaussier, Jean-Pierre Chevallet, Joo Hwee Lim, Similarity Learning for Nearest Neighbor Classification international conference on data mining. pp. 983- 988 ,(2008) , 10.1109/ICDM.2008.81
Kevin Bache, Moshe Lichman, UCI Machine Learning Repository University of California, School of Information and Computer Science. ,(2007)