Mining meaning from Wikipedia

作者: Olena Medelyan , David Milne , Catherine Legg , Ian H. Witten

DOI: 10.1016/J.IJHCS.2009.05.004

关键词:

摘要: Wikipedia is a goldmine of information; not just for its many readers, but also the growing community researchers who recognize it as resource exceptional scale and utility. It represents vast investment manual effort judgment: huge, constantly evolving tapestry concepts relations that being applied to host tasks. This article provides comprehensive description this work. focuses on research extracts makes use concepts, relations, facts descriptions found in Wikipedia, organizes work into four broad categories: applying natural language processing; using facilitate information retrieval extraction; ontology building. The addresses how used is, improved adapted, combined with other structures create entirely new resources. We identify groups individuals involved, their has developed last few years. provide list open-source software they have produced.

参考文章(183)
Zsolt Minier, Zalan Bodo, Lehel Csato, Wikipedia-Based Kernels for Text Categorization symbolic and numeric algorithms for scientific computing. pp. 157- 164 ,(2007) , 10.1109/SYNASC.2007.8
Evgeniy Gabrilovich, Shaul Markovitch, Overcoming the brittleness bottleneck using wikipedia: enhancing text categorization with encyclopedic knowledge national conference on artificial intelligence. pp. 1301- 1306 ,(2006)
David Milne, Ian H. Witten, Learning to link with wikipedia Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08. pp. 509- 518 ,(2008) , 10.1145/1458082.1458150
Gjergji Kasneci, Fabian M. Suchanek, Georgiana Ifrim, Maya Ramanath, Gerhard Weikum, NAGA: Searching and Ranking Knowledge international conference on data engineering. pp. 953- 962 ,(2008) , 10.1109/ICDE.2008.4497504
Jay J Jiang, David W Conrath, None, Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy Proceedings of the 10th Research on Computational Linguistics International Conference. pp. 19- 33 ,(1997)
Maria Ruiz-Casado, Enrique Alfonseca, Pablo Castells, Automatising the learning of lexical patterns: An application to the enrichment of WordNet by extracting semantic relationships from Wikipedia data and knowledge engineering. ,vol. 61, pp. 484- 499 ,(2007) , 10.1016/J.DATAK.2006.06.011
George A. Miller, Walter G. Charles, Contextual correlates of semantic similarity Language and Cognitive Processes. ,vol. 6, pp. 1- 28 ,(1991) , 10.1080/01690969108406936
S. P. Ponzetto, M. Strube, Knowledge derived from wikipedia for computing semantic relatedness Journal of Artificial Intelligence Research. ,vol. 30, pp. 181- 212 ,(2007) , 10.1613/JAIR.2308
Simone Paolo Ponzetto, Michael Strube, An API for Measuring the Relatedness of Words in Wikipedia meeting of the association for computational linguistics. pp. 49- 52 ,(2007) , 10.3115/1557769.1557785
K.S. Schlobach, G.A. Mishne, V. Jijkoun, D.D. Ahn, K.E. Müller, M. de Rijke, Using Wikipedia at the TREC QA Track text retrieval conference. ,(2005)