作者: Olena Medelyan , David Milne , Catherine Legg , Ian H. Witten
DOI: 10.1016/J.IJHCS.2009.05.004
关键词:
摘要: Wikipedia is a goldmine of information; not just for its many readers, but also the growing community researchers who recognize it as resource exceptional scale and utility. It represents vast investment manual effort judgment: huge, constantly evolving tapestry concepts relations that being applied to host tasks. This article provides comprehensive description this work. focuses on research extracts makes use concepts, relations, facts descriptions found in Wikipedia, organizes work into four broad categories: applying natural language processing; using facilitate information retrieval extraction; ontology building. The addresses how used is, improved adapted, combined with other structures create entirely new resources. We identify groups individuals involved, their has developed last few years. provide list open-source software they have produced.