Facing the Challenges of Data Integration in Biosciences

作者: Aiguo Li

DOI:

关键词: Information extractionStrengths and weaknessesData miningBiological databaseClinical scienceBiologyBiological dataSystems biologyData scienceData integration

摘要: Data integration in molecular biology and clinical science has become imperative for providing the comprehensive information extraction systems biology. In this review we evaluate evolution characteristics of biological databases examine existing approaches to data bioscience. Strengths weaknesses these are identified by surveying several successful examples integration. We point out challenges faced possible solutions on various levels while contrasting efforts biosciences with those industry. Index Terms integration, federation, warehouse,

参考文章(36)
Rolf Apweiler, Manuela Pruess, Paul J. Kersey, The Integr8 project--a resource for genomic and proteomic data. in Silico Biology. ,vol. 5, pp. 179- 185 ,(2005)
Paul T Spellman, Michael Miller, Jason Stewart, Charles Troup, Ugis Sarkans, Steve Chervitz, Derek Bernhart, Gavin Sherlock, Catherine Ball, Marc Lepage, Marcin Swiatek, WL Marks, Jason Goncalves, Scott Markel, Daniel Iordan, Mohammadreza Shojatalab, Angel Pizarro, Joe White, Robert Hubley, Eric Deutsch, Martin Senger, Bruce J Aronow, Alan Robinson, Doug Bassett, Christian J Stoeckert, Alvis Brazma, Design and implementation of microarray gene expression markup language (MAGE-ML) Genome Biology. ,vol. 3, pp. 1- 9 ,(2002) , 10.1186/GB-2002-3-9-RESEARCH0046
Yoshikazu Hasegawa, Motoaki Seki, Yoshiki Mochizuki, Naohiko Heida, Katsura Hirosawa, Naoki Okamoto, Tetsuya Sakurai, Masakazu Satou, Kenji Akiyama, Kei Iida, Kisik Lee, Shigehiko Kanaya, Taku Demura, Kazuo Shinozaki, Akihiko Konagaya, Tetsuro Toyoda, None, A flexible representation of omic knowledge for thorough analysis of microarray data Plant Methods. ,vol. 2, pp. 5- 5 ,(2006) , 10.1186/1746-4811-2-5
Rakesh Nagarajan, Mushtaq Ahmed, Aditya Phatak, Database challenges in the integration of biomedical data sets very large data bases. pp. 1202- 1213 ,(2004) , 10.1016/B978-012088469-8.50107-8
Claudia Imhoff, Ryan Sousa, William H. Inmon, Corporate Information Factory ,(1998)
Hui Ge, Albertha J.M Walhout, Marc Vidal, Integrating 'omic' information: a bridge between genomics and systems biology. Trends in Genetics. ,vol. 19, pp. 551- 560 ,(2003) , 10.1016/J.TIG.2003.08.009
R. Stanislaus, C. Chen, J. Franklin, J. Arthur, J. S. Almeida, AGML Central: web based gel proteomic infrastructure Bioinformatics. ,vol. 21, pp. 1754- 1757 ,(2005) , 10.1093/BIOINFORMATICS/BTI246
S. B. Davidson, J. Crabtree, B. P. Brunk, J. Schug, V. Tannen, G. C. Overton, C. J. Stoeckert, K2/Kleisli and GUS: experiments in integrated access to genomic data sources Ibm Systems Journal. ,vol. 40, pp. 512- 531 ,(2001) , 10.1147/SJ.402.0512
I-Min A. Chen, Victor M. Markowitz, An overview of the object protocol model (OPM) and the OPM data management tools Information Systems. ,vol. 20, pp. 393- 418 ,(1995) , 10.1016/0306-4379(95)00021-U