Construction of protein semantic networks using PubMed/MEDLINE

作者: E. A. Ponomarenko , A. V. Lisitsa , E. V. Il’gisonis , A. I. Archakov

DOI: 10.1134/S0026893310010176

关键词: Construct (python library)Human proteinsBase sequenceMEDLINEBioinformaticsContext (language use)Natural language processingArtificial intelligenceEdge densityComputer scienceSemantic network

摘要: A method for constructing protein semantic networks using MEDLINE abstracts is proposed. The publications retrieved by the context search names (relevant) and related were used. proposed based on estimation of connectivity between proteins. score was calculated as a function number relevant or papers found pair This used to construct network 150 human proteins belonging five different metabolic pathways. Analysis demonstrated that involved in associated molecular processes formed subgraphs with high edge density.

参考文章(15)
Tim Beissbarth, Interpreting experimental results using gene ontologies. Methods in Enzymology. ,vol. 411, pp. 340- 352 ,(2006) , 10.1016/S0076-6879(06)11018-6
Yang Wang, Philip A. Marsden, Nitric oxide synthases: gene structure and regulation. Advances in pharmacology (San Diego). ,vol. 34, pp. 71- 90 ,(1995) , 10.1016/S1054-3589(08)61081-9
Gene Ontology Consortium, None, The Gene Ontology (GO) database and informatics resource Nucleic Acids Research. ,vol. 32, pp. 258D- 261 ,(2004) , 10.1093/NAR/GKH036
Gilbert S Omenn, David J States, Marcin Adamski, Thomas W Blackwell, Rajasree Menon, Henning Hermjakob, Rolf Apweiler, Brian B Haab, Richard J Simpson, James S Eddes, Eugene A Kapp, Robert L Moritz, Daniel W Chan, Alex J Rai, Arie Admon, Ruedi Aebersold, Jimmy Eng, William S Hancock, Stanley A Hefta, Helmut Meyer, Young‐Ki Paik, Jong‐Shin Yoo, Peipei Ping, Joel Pounds, Joshua Adkins, Xiaohong Qian, Rong Wang, Valerie Wasinger, Chi Yue Wu, Xiaohang Zhao, Rong Zeng, Alexander Archakov, Akira Tsugita, Ilan Beer, Akhilesh Pandey, Michael Pisano, Philip Andrews, Harald Tammen, David W Speicher, Samir M Hanash, None, Overview of the HUPO Plasma Proteome Project: results from the pilot phase with 35 collaborating laboratories and multiple analytical groups, generating a core dataset of 3020 proteins and a publicly-available database. Proteomics. ,vol. 5, pp. 3226- 3245 ,(2005) , 10.1002/PMIC.200500358
Satomi Nadanaka, Hiroshi Kitagawa, Heparan sulphate biosynthesis and disease. Journal of Biochemistry. ,vol. 144, pp. 7- 14 ,(2008) , 10.1093/JB/MVN040
D. J. Rogers, T. T. Tanimoto, A Computer Program for Classifying Plants Science. ,vol. 132, pp. 1115- 1118 ,(1960) , 10.1126/SCIENCE.132.3434.1115
Markus Bundschus, Mathaeus Dejori, Martin Stetter, Volker Tresp, Hans-Peter Kriegel, Extraction of semantic biomedical relations from text using conditional random fields BMC Bioinformatics. ,vol. 9, pp. 207- 207 ,(2008) , 10.1186/1471-2105-9-207
Jimmy Lin, W John Wilbur, PubMed related articles: a probabilistic topic-based model for content similarity. BMC Bioinformatics. ,vol. 8, pp. 423- 423 ,(2007) , 10.1186/1471-2105-8-423
Todd W Harris, Nansheng Chen, Fiona Cunningham, Marcela Tello‐Ruiz, Igor Antoshechkin, Carol Bastiani, Tamberlyn Bieri, Darin Blasiar, Keith Bradnam, Juancarlos Chan, Chao‐Kung Chen, Wen J Chen, Paul Davis, Eimear Kenny, Ranjana Kishore, Daniel Lawson, Raymond Lee, Hans‐Michael Muller, Cecilia Nakamura, Philip Ozersky, Andrei Petcherski, Anthony Rogers, Aniko Sabo, Erich M Schwarz, Kimberly Van Auken, Qinghua Wang, Richard Durbin, John Spieth, Paul W Sternberg, Lincoln D Stein, WormBase: a multi-species resource for nematode biology and genomics Nucleic Acids Research. ,vol. 32, pp. 411D- 417 ,(2004) , 10.1093/NAR/GKH066