MetMaxStruct: A Tversky-Similarity-Based Strategy for Analysing the (Sub)Structural Similarities of Drugs and Endogenous Metabolites

作者: Steve O'Hagan , Douglas B. Kell

DOI: 10.3389/FPHAR.2016.00266

关键词:

摘要: Background. Previous studies compared the molecular similarity of marketed drugs and endogenous human metabolites (endogenites), using a series fingerprint-type encodings, variously ranked clustered Tanimoto (Jaccard) coefficient (TS). Because this gives equal weight to all parts encoding (thence different substructures in molecule) it may not be optimal, since many cases molecule will bind their macromolecular targets. Unsupervised methods cannot alone uncover this. We here explore kinds differences that observed when TS is replaced – manner more equivalent semi-supervised learning by variants asymmetric Tversky (TV) similarity, includes   parameters. Results. Dramatic are (i) drug-endogenite heatmaps, (ii) cumulative ‘greatest similarity’ curves, (iii) fraction with metabolite exceeding given value parameters varied from values. The same true sum varied. A clear trend towards increased endogenite-likeness or adopt values nearer extremes range, smaller. molecules exhibiting greatest two interrogating drug (chlorpromazine clozapine) also vary both nature as for converse, interrogated an endogenite. library depends on contents library, ‘tuned’ accordingly, manner. At some discovery candidates natural products can “look” much like (i.e. have numerical closer to) than do even endogenites. Conclusions. Overall, metrics provide useful range examples does simpler help draw attention

参考文章(124)
Gerald M. Maggiora, Veerabahu Shanmugasundaram, Molecular Similarity Measures Methods in Molecular Biology. ,vol. 672, pp. 39- 100 ,(2011) , 10.1007/978-1-60761-839-3_2
D. B. Kell, P. Dobson, The cellular uptake of pharmaceutical drugs is mainly carrier-mediated and is thus an issue not so much of biophysics but of systems biology In: System Chemistry: Systems Chemistry; 26 May 2008-30 May 2008; Bozen, Italy. Frankfurt: Beilstein Institute; 2009. p. 149-168.. pp. 149- 168 ,(2009)
David Lowe, David S. Broomhead, Radial Basis Functions, Multi-Variable Functional Interpolation and Adaptive Networks Complex Systems. ,vol. 2, pp. 321- 355 ,(1988)
Steve O’Hagan, Douglas B. Kell, Software review: the KNIME workflow environment and its applications in genetic programming and machine learning Genetic Programming and Evolvable Machines. ,vol. 16, pp. 387- 391 ,(2015) , 10.1007/S10710-015-9247-3
José L. Medina-Franco, Gerald M. Maggiora, MOLECULAR SIMILARITY ANALYSIS Chemoinformatics for Drug Discovery. pp. 343- 399 ,(2013) , 10.1002/9781118742785.CH15
John J. Irwin, Using ZINC to Acquire a Virtual Screening Library Current protocols in human genetics. ,vol. 22, ,(2008) , 10.1002/0471250953.BI1406S22
Andreas Bender, Jeremy L. Jenkins, Qingliang Li, Sam E. Adams, Edward O. Cannon, Robert C. Glen, Chapter 9 Molecular Similarity: Advances in Methods, Applications and Validations in Virtual Screening and QSAR Annual Reports in Computational Chemistry. ,vol. 2, pp. 141- 168 ,(2006) , 10.1016/S1574-1400(06)02009-3
John J. Irwin, Teague Sterling, ZINC 15 – Ligand Discovery for Everyone Journal of Chemical Information and Modeling. ,vol. 55, pp. 2324- 2337 ,(2015) , 10.1021/ACS.JCIM.5B00559