Benchmarking Ontologies: Bigger or Better?

作者: Lixia Yao , Anna Divoli , Ilya Mayzus , James A. Evans , Andrey Rzhetsky

DOI: 10.1371/JOURNAL.PCBI.1001055

关键词:

摘要: A scientific ontology is a formal representation of knowledge within domain, typically including central concepts, their properties, and relations. With the rise computers high-throughput data collection, ontologies have become essential to mining sharing across communities in biomedical sciences. Powerful approaches exist for testing internal consistency an ontology, but not assessing fidelity its domain representation. We introduce family metrics that describe breadth depth with which represents domain. then test these using (1) four most common medical respect corpus documents (2) seven popular English thesauri three corpora sample language from medicine, news, novels. Here we show our approach captures quality ontological guides efforts narrow breach between collective discourse Our results also demonstrate key features ontologies, thesauri, different domains. Medical small intersection, as do thesauri. Moreover, dialects characteristic distinct domains vary strikingly many same words are used quite differently As intended mirror state knowledge, methods tighten fit will increase relevance new areas science improve accuracy power inferences computed them.

参考文章(75)
Klemens Schnattinger, Udo Hahn, Towards text knowledge engineering national conference on artificial intelligence. pp. 524- 531 ,(1998)
Marko Grobelnik, Janez Brank, Dunja Mladenic, Golden Standard Based Ontology Evaluation Using Instance Assignment. EON@WWW. ,(2006)
Jose L. V. Mejino, Cornelius Rosse, Landon T. Detwiler, Daniel L. Cook, Anand Kumar, Barry Smith, A Strategy for Improving and Integrating Biomedical Ontologies american medical informatics association annual symposium. ,vol. 2005, pp. 639- 643 ,(2005)
Svetlana Kiritchenko, Fazel Famili, Stan Matwin, Functional Annotation of Genes Using Hierarchical Text Categorization ,(2005)
Carolyn E. Lipscomb, Medical Subject Headings (MeSH) Bulletin of The Medical Library Association. ,vol. 88, pp. 265- 266 ,(2000)
A. T. McCray, Extending a natural language parser with UMLS knowledge. annual symposium on computer application in medical care. pp. 194- 198 ,(1991)
Razi A, McCray At, Browne Ac, Aronson Ar, Srinivasan S, Rindflesch Tc, UMLS knowledge for biomedical language processing. Bulletin of The Medical Library Association. ,vol. 81, pp. 184- 194 ,(1993)
Thomas C. Rindflesch, Alan R. Aronson, Query expansion using the UMLS Metathesaurus. conference of american medical informatics association. pp. 485- 489 ,(1997)
Ellen Riloff, Rosie Jones, Learning dictionaries for information extraction by multi-level bootstrapping national conference on artificial intelligence. pp. 474- 479 ,(1999)
Barbara Ann Kipfer, 21st Century Synonym and Antonym Finder ,(1995)