Using Wikipedia categories for compact representations of chemical documents

作者: Benjamin Köhncke , Wolf-Tilo Balke

DOI: 10.1145/1871437.1871735

关键词:

摘要: Today, Web pages are usually accessed using text search engines, whereas documents stored in the deep through domain-specific portals. These portals rely on external knowledge bases, respectively ontologies, mapping to more general concepts allowing for suitable classifications and navigational browsing. Since automatically generated ontologies still not satisfactory advanced information retrieval tasks, most heavily hand-crafted ontologies. This, however, also leads high creation maintaining costs. On other hand, a freely available community maintained, if somewhat general, base is offered by Wikipedia. During last years coverage of Wikipedia has reached large pool including articles from almost all domains. In this paper, we investigate use categories describe content chemical compact form. We compare results ChEBI ontology show that indeed allow useful descriptions even better than ontology.

参考文章(11)
Claire Nédellec, Gilles Bisson, Dolores Cañamero, Designing clustering methods for ontology building: the Mo'K workbench OL'00 Proceedings of the First International Conference on Ontology Learning - Volume 31. pp. 13- 28 ,(2000)
Sascha Tönnies, Benjamin Köhncke, Wolf-Tilo Balke, Oliver Köpler, Building Chemical Information Systems - the ViFaChem II Project. BTW. pp. 247- 256 ,(2009)
Peter Corbett, Peter Murray-Rust, High-Throughput Identification of Chemistry in Life Science Texts Computational Life Sciences II. pp. 107- 118 ,(2006) , 10.1007/11875741_11
Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, Xiaohua Zhou, Exploiting Wikipedia as external knowledge for document clustering Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '09. pp. 389- 396 ,(2009) , 10.1145/1557019.1557066
Peter Schonhofen, Identifying Document Topics Using the Wikipedia Category Network web intelligence. pp. 456- 462 ,(2006) , 10.1109/WI.2006.92
Sascha Tönnies, Benjamin Köhncke, Oliver Koepler, Wolf-Tilo Balke, Exposing the hidden web for chemical digital libraries acm/ieee joint conference on digital libraries. pp. 235- 244 ,(2010) , 10.1145/1816123.1816159
K. Degtyarenko, P. de Matos, M. Ennis, J. Hastings, M. Zbinden, A. McNaught, R. Alcantara, M. Darsow, M. Guedj, M. Ashburner, ChEBI: a database and ontology for chemical entities of biological interest Nucleic Acids Research. ,vol. 36, pp. 344- 350 ,(2007) , 10.1093/NAR/GKM791
JooYoung Choi, Melissa J Davis, Andrew F Newman, Mark A Ragan, None, A semantic web ontology for small molecules and their biological targets. Journal of Chemical Information and Modeling. ,vol. 50, pp. 732- 741 ,(2010) , 10.1021/CI900461J
Jonathan Yu, James A. Thom, Audrey Tam, Ontology evaluation using wikipedia categories for browsing conference on information and knowledge management. pp. 223- 232 ,(2007) , 10.1145/1321440.1321474