Mining web query hierarchies from clickthrough data

作者: Weizhu Chen , Qiang Yang , Dou Shen , Zheng Chen , Min Qin

DOI:

关键词: Web query classificationQuery expansionRelation (database)Web search querySpatial queryInformation retrievalWeb pageComputer scienceSearch engine

摘要: In this paper, we propose to mine query hierarchies from clickthrough data, which is within the larger area of automatic acquisition knowledge Web. When a user submits search engine and clicks on returned Web pages, user's understanding as well its relation pages encoded in data. With millions queries being submitted engines every day, it both important beneficial hidden their intended pages. We can use information various ways, such providing suggestions organizing queries. plan exploit logs by constructing hierarchies, reflect relationship among Our proposed method consists two stages: generating candidate determining "generalization/specialization" relatinns between these hierarchy. test our some labeled data sets illustrate effectiveness solution empirically.

参考文章(20)
Marius Pasca, Benjamin Van Durme, What you seek is what you get: extraction of class attributes from query logs international joint conference on artificial intelligence. pp. 2832- 2837 ,(2007)
P. Buitelaar, P. Cimiano, B. Magnini, Ontology Learning from Text: Methods, Evaluation and Applications ,(2005)
Paul Buitelaar, Daniel Olejnik, Michael Sintek, A protégé plug-in for ontology extraction from text based on linguistic analysis Lecture Notes in Computer Science. pp. 31- 44 ,(2004) , 10.1007/978-3-540-25956-5_3
Zellig Sabbettai Harris, Mathematical structures of language ,(1968)
Hinrich Schütze, Christopher D. Manning, Prabhakar Raghavan, Introduction to Information Retrieval ,(2005)
Stephan Bloehdorn, Philipp Cimiano, Andreas Hotho, Learning Ontologies to Improve Text Clustering and Classification GfKl. pp. 334- 341 ,(2006) , 10.1007/3-540-31314-1_40
Hermine Njike Fotzo, Patrick Gallinari, Learning « generalization/specialization » relations between concepts: application for automatically building thematic document hierarchies RIAO '04 Coupling approaches, coupling media and coupling languages for information retrieval. pp. 143- 155 ,(2004)
S. Bloehdorn, A. Hotho, Text classification by boosting weak learners based on terms and concepts international conference on data mining. pp. 331- 334 ,(2004) , 10.1109/ICDM.2004.10077
Doug Beeferman, Adam Berger, Agglomerative clustering of a search engine query log knowledge discovery and data mining. pp. 407- 416 ,(2000) , 10.1145/347090.347176
Luis Gravano, Vasileios Hatzivassiloglou, Richard Lichtenstein, Categorizing web queries according to geographical locality Proceedings of the twelfth international conference on Information and knowledge management - CIKM '03. pp. 325- 333 ,(2003) , 10.1145/956863.956925