Efficient query routing for information retrieval in semantic overlays

作者: Hai Jin , Xiaomin Ning , Hanhua Chen , Zuoning Yin

DOI: 10.1145/1141277.1141672

关键词:

摘要: A fundamental problem in peer-to-peer networks is how to locate appropriate peers efficiently answer a specific query request. This paper proposes model which semantically similar form semantic overlay network and can be routed or forwarded instead of broadcasting random selection. We apply Latent Semantic Indexing (LSI) information retrieval reveal subspaces feature spaces from documents stored on peers. After producing vectors through LSI, we train support vector machine (SVM) classify the into different categories based extracted vectors. Peers with close are defined as similarity overlay. Experimental results show efficient performs better than other non-semantic models respect accuracy. In addition, our approach improves recall rate nearly 100% while reducing message traffic dramatically compared Gnutella.

参考文章(14)
dave beckett, World Wide Web Conference 2004 Ariadne. ,(2004)
Tamara G. Kolda, Limited-memory matrix methods with applications University of Maryland at College Park. ,(1997)
Michael W. Berry, Zlatko Drmac, Elizabeth R. Jessup, Matrices, Vector Spaces, and Information Retrieval SIAM Review. ,vol. 41, pp. 335- 362 ,(1999) , 10.1137/S0036144598347035
David Hull, Improving text retrieval for the routing problem using latent semantic indexing international acm sigir conference on research and development in information retrieval. pp. 282- 291 ,(1994) , 10.5555/188490.188585
Bernhard E. Boser, Isabelle M. Guyon, Vladimir N. Vapnik, A training algorithm for optimal margin classifiers conference on learning theory. pp. 144- 152 ,(1992) , 10.1145/130385.130401
Chunqiang Tang, Zhichen Xu, Sandhya Dwarkadas, Peer-to-peer information retrieval using self-organizing semantic overlay networks Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications - SIGCOMM '03. pp. 175- 186 ,(2003) , 10.1145/863955.863976
Chunqiang Tang, Sandhya Dwarkadas, Zhichen Xu, On scaling latent semantic indexing for large peer-to-peer systems Proceedings of the 27th annual international conference on Research and development in information retrieval - SIGIR '04. pp. 112- 121 ,(2004) , 10.1145/1008992.1009014
U.M. Feyyad, Data mining and knowledge discovery: making sense out of data IEEE Intelligent Systems. ,vol. 11, pp. 20- 25 ,(1996) , 10.1109/64.539013
Christopher J.C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition Data Mining and Knowledge Discovery. ,vol. 2, pp. 121- 167 ,(1998) , 10.1023/A:1009715923555
Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, Richard Harshman, Indexing by Latent Semantic Analysis Journal of the Association for Information Science and Technology. ,vol. 41, pp. 391- 407 ,(1990) , 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9