Automated Extraction of Protein Mutation Impacts from the Biomedical Literature

作者: Nona Naderi

DOI:

关键词:

摘要: Mutations as sources of evolution have long been the focus attention in the biomedical literature. Accessing mutational information and their impacts on protein properties facilitates research various domains, such as enzymology pharmacology. However, manually reading through rich fast growing repository of biomedical literature is expensive time-consuming. A number curated databases, BRENDA (http://www.brenda-enzymes.org), try to index provide this information; yet provided data seems be incomplete. Thus, there a growing need for automated approaches extract this information. In work, we present a system automatically summarize impact information from mutations. Our extraction module split into subtasks: organism analysis, mutation detection, property impact analysis. Organisms, proteins, are required extracted to help disambiguation genes proteins. our extracts and grounds organisms NCBI. We detect mutation series correctly ground detected impacts. Our also affected well magnitude the effects. The output populated an OWL-DL ontology, which can then queried structured information. The performance evaluated on both external internal corpora and databases. results show reliability approaches. Organism achieves precision recall 95% and 94% grounding accuracy 97.5% OT corpus. On manually annotated corpus Linneaus-100, of 99% 97% with 97.4%. In impact detection task, of 70.4%-71.8% 71.2%-71.3% annotated documents. grounds detected impacts 70.1%-71.7% documents and 57%-57.5% 82.5%-84.2% against data.

参考文章(46)
Arash Shaban-Nejad, Volker Haarslev, René Witte, Greg Butler, Christopher J.O. Baker, The FungalWeb Ontology: Application Scenarios Eighth Annual Bio-Ontologies Meeting. ,vol. 4364, ,(2005)
Xinglong Wang, Claire Grover, Learning the Species of Biomedical Named Entities from Annotated Corpora language resources and evaluation. ,(2008)
Ian Horrocks, Christine Golbreich, The OBO to OWL Mapping, GO to OWL 1.1! owl: experiences and directions. ,(2007)
René Witte, Thomas Kappler, Christopher J. O. Baker, Ontology Design for Biomedical Text Mining Semantic Web: Revolutionizing Knowledge Discovery in the Life Sciences. ,vol. 25, pp. 281- 313 ,(2007) , 10.1007/978-0-387-48438-9_14
René Witte, Ninus Khamis, Juergen Rilling, Flexible Ontology Population from Text: The OwlExporter language resources and evaluation. ,(2010)
Xinglong Wang, Rule-Based Protein Term Identification with Help from Automatic Species Tagging Computational Linguistics and Intelligent Text Processing. pp. 288- 298 ,(2007) , 10.1007/978-3-540-70939-8_26
Frank van van Harmelen, Grigoris Antoniou, Rinke Hoekstra, Paul Groth, A semantic Web primer ,(2004)
Wolfram Liebermeister, Edda Klipp, Simon Borger, Prediction of enzyme kinetic parameters based on statistical learning. Genome Informatics. ,vol. 17, pp. 80- 87 ,(2006) , 10.11234/GI1990.17.80