Highlighting relevant concepts from Topic Signatures

作者: German Rigau , Llu'is Padr'o , Montse Cuadros

DOI:

关键词: Computer scienceInformation retrievalTask (project management)WordNetSignature (logic)Artificial intelligenceNatural language processingResource (project management)Similarity (psychology)Word (computer architecture)Word-sense disambiguationFilter (higher-order function)

摘要: This paper presents deepKnowNet, a new fully automatic method for building highly dense and accurate knowledge bases from existing semantic resources. Basically, the applies knowledge-based Word Sense Disambiguation algorithm to assign most appropriate WordNet sense large sets of topically related words acquired web, named TSWEB. is personalized PageRank implemented in UKB. improves by means current content creating volumes semantic relations between synsets. KnowNet was our first attempt towards acquisition relations. However, had some limitations that have been overcomed with deepKnowNet. deepKnowNet disambiguates hundred all Topic Signatures web (TSWEB). In this case, highlights relevant word senses each Signature filter out ones are not so topic. fact, the it contains outperforms any other resource when empirically evaluated common framework based on similarity task annotated human judgements

参考文章(23)
Bernardo Magnini, Gabriela Cavaglia, Integrating Subject Field Codes into WordNet language resources and evaluation. ,(2000)
Eneko Agirre, Oier Lopez de Lacalle, Publicly Available Topic Signatures for all WordNet Nominal Senses language resources and evaluation. ,(2004)
Eneko Agirre, Eduard H. Hovy, David Martínez, Olatz Ansa, Enriching WordNet concepts with topic signatures arXiv: Computation and Language. ,(2001)
Jordi Atserias, German Rigau, Egoitz Laparra, Javier Álvez, Antoni Oliver, Salvador Climent, Jordi Carrera, Complete and Consistent Annotation of WordNet using the Top Concept Ontology language resources and evaluation. ,(2008)
Martin Chodorow, George A. Miller, Claudia Leacock, Using corpus statistics and WordNet relations for sense identification Computational Linguistics. ,vol. 24, pp. 147- 165 ,(1998) , 10.5555/972719.972726
Rada Mihalcea, Dan I. Moldovan, eXtended WordNet: progress report north american chapter of the association for computational linguistics. ,(2001)
Eneko Agirre, David Martínez, Integrating selectional preferences in WordNet arXiv: Computation and Language. ,(2002)
Montse Cuadros, German Rigau, Quality Assessment of Large Scale Knowledge Resources empirical methods in natural language processing. pp. 534- 541 ,(2006) , 10.3115/1610075.1610149
Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum, Yago: a core of semantic knowledge the web conference. pp. 697- 706 ,(2007) , 10.1145/1242572.1242667