Complementarity between public and commercial databases: new opportunities in medicinal chemistry informatics.

作者: Christopher Southan , Peter Varkonyi , Sorel Muresan

DOI: 10.2174/156802607782194761

关键词: Pairwise comparisonDatabaseDownloadData collectionPubChemCheminformaticsComputer scienceData scienceCommercial SourcesInformaticsBiological data

摘要: The last two years have seen a dramatic expansion in public cheminformatics, as exemplified by the approximate five-fold growth of PubChem from over 50 contributing data sources. Consequently, medicinal chemists who were hitherto limited to commercial databases now also access sources that they can download and/or query directly Web. range sources, particularly where link out structured bioinformatic and biological data, already offer utilities no equivalent. This work reviews compound content comparisons between selected capture bioactive content. We focused on those specify relationships compounds their protein targets. Our stringent filtering produced lower unique numbers than reported for individual thereby facilitated standardised resultant matrix shows pairwise comparison each database subsets. Overall, this showed an unexpected degree non-overlap, emphasising complementarity gained combining conclusion is supported Venn-type analysis GVKBIO, WOMBAT (both commercial) (public). These show not only overlap but case because different strategies source selection collection.

参考文章(31)
Sherri L Rogalski, Dragos Horvath, Boryeu Mao, Cecile M Krejsa, Jacques C Migeon, Frédérique Barbosa, Julie E Penzotti, Predicting ADME properties and side effects: the BioPrint approach. Current Opinion in Drug Discovery & Development. ,vol. 6, pp. 470- 480 ,(2003)
G. Divita, J. G. Mork, A. C. Browne, A. R. Aronson, G. F. Hazard, W. J. Wilbur, Analysis of biomedical text for chemical names: a comparison of three methods. american medical informatics association annual symposium. pp. 176- 180 ,(1999)
Marius Olah, Ramona Rad, Liliana Ostopovici, Alina Bora, Nicoleta Hadaruga, Dan Hadaruga, Ramona Moldovan, Adriana Fulias, Maria Mractc, Tudor I. Oprea, WOMBAT and WOMBAT‐PK: Bioactivity Databases for Lead and Drug Discovery John Wiley & Sons, Ltd. pp. 760- 786 ,(2008) , 10.1002/9783527619375.CH13B
Ramaswamy Nilakantan, Norman Bauman, Kevin S. Haraki, Database diversity assessment: new ideas, concepts, and tools. Journal of Computer-aided Molecular Design. ,vol. 11, pp. 447- 452 ,(1997) , 10.1023/A:1007937308615
Peter W. Kenny, Jens Sadowski, Structure Modification in Chemical Databases Wiley‐VCH Verlag GmbH & Co. KGaA. pp. 271- 285 ,(2005) , 10.1002/3527603743.CH11
Christopher P Austin, Linda S Brady, Thomas R Insel, Francis S Collins, NIH Molecular Libraries Initiative. Science. ,vol. 306, pp. 1138- 1139 ,(2004) , 10.1126/SCIENCE.1105511
Kevin A Snyder, Howard J Feldman, Michel Dumontier, John J Salama, Christopher WV Hogue, Domain-based small molecule binding site annotation BMC Bioinformatics. ,vol. 7, pp. 152- 152 ,(2006) , 10.1186/1471-2105-7-152
Peter Murray-Rust, John Mitchell, Henry Rzepa, Chemistry in bioinformatics. BMC Bioinformatics. ,vol. 6, pp. 141- 141 ,(2005) , 10.1186/1471-2105-6-141
David S. Wishart, Bioinformatics in Drug Development and Assessment Drug Metabolism Reviews. ,vol. 37, pp. 279- 310 ,(2005) , 10.1081/DMR-55225
Gaia V Paolini, Richard H B Shapland, Willem P van Hoorn, Jonathan S Mason, Andrew L Hopkins, Global mapping of pharmacological space Nature Biotechnology. ,vol. 24, pp. 805- 815 ,(2006) , 10.1038/NBT1228