作者: Jordi Atserias , Giuseppe Attardi , Hugo Zaragoza , Massimiliano Ciaramita
DOI:
关键词: Computer science 、 Explicit semantic analysis 、 Snapshot (computer storage) 、 Documentation 、 Information retrieval 、 World Wide Web
摘要: This paper describes SW1, the first version of a semantically annotated snapshot English Wikipedia. In recent years Wikipedia has become valuable resource for both Natural Language Processing (NLP) community and Information Retrieval (IR) community. Although NLP technology processing already exists, not all researchers developers have computational resources to process such volume information. Moreover, use different versions processed differently might make it difficult compare results. The aim this work is provide easy access syntactic semantic annotations IR communities by building reference corpus homogenize experiments results comparable. These resources, entity containment derived graph, are licensed under GNU Free Documentation License available from http://www.yr-bcn.es/semanticWikipedia