Clustering files of chemical structures using the Székely–Rizzo generalization of Ward's method

作者: Thibault Varin , Ronan Bureau , Christoph Mueller , Peter Willett

DOI: 10.1016/J.JMGM.2009.06.006

关键词:

摘要: Ward's method is extensively used for clustering chemical structures represented by 2D fingerprints. This paper compares Ward clusterings of 14 datasets (containing between 278 and 4332 molecules) with those obtained using the Szekely–Rizzo method, a generalization method. The clusters resulting from these two methods were evaluated extent to which various classifications able group active molecules together, novel criterion effectiveness. Analysis total 1400 (Ward methods, different datasets, 5 fingerprints 10 distance coefficients) demonstrated general superiority coefficient first described Soergel performed extremely well in experiments, this was also case when it simulated virtual screening experiments.

参考文章(37)
G. N. Lance, W. T. Williams, Mixed-Data Classificatory Programs I - Agglomerative Systems. Australian Computer Journal. ,vol. 1, pp. 15- 20 ,(1967)
Geoff M. Downs, John M. Barnard, Clustering Methods and Their Uses in Computational Chemistry Reviews in Computational Chemistry, Volume 18. pp. 1- 40 ,(2003) , 10.1002/0471433519.CH1
Uli Fechner, Gisbert Schneider, Evaluation of distance metrics for ligand-based similarity searching. ChemBioChem. ,vol. 5, pp. 538- 540 ,(2004) , 10.1002/CBIC.200300812
Andreas Bender, Jeremy L. Jenkins, Josef Scheiber, Sai Chetan K. Sukuru, Meir Glick, John W. Davies, How similar are similarity searching methods? A principal component analysis of molecular descriptor space Journal of Chemical Information and Modeling. ,vol. 49, pp. 108- 119 ,(2009) , 10.1021/CI800249S
Peter Willett, Similarity-based virtual screening using 2D fingerprints Drug Discovery Today. ,vol. 11, pp. 1046- 1053 ,(2006) , 10.1016/J.DRUDIS.2006.10.005
Frank Critchley, Willem Heiser, Hierarchical trees can be perfectly scaled in one dimension Journal of Classification. ,vol. 5, pp. 5- 20 ,(1988) , 10.1007/BF01901668
Thibault Varin, Nicolas Saettel, Jonathan Villain, Aurelien Lesnard, François Dauphin, Ronan Bureau, Sylvain Rault, 3D Pharmacophore, hierarchical methods, and 5-HT4 receptor binding data Journal of Enzyme Inhibition and Medicinal Chemistry. ,vol. 23, pp. 593- 603 ,(2008) , 10.1080/14756360802204748
Geoffrey M. Downs, Peter Willett, William Fisanick, Similarity Searching and Clustering of Chemical-Structure Databases Using Molecular Property Data Journal of Chemical Information and Computer Sciences. ,vol. 34, pp. 1094- 1102 ,(1994) , 10.1021/CI00021A011
Jérôme Hert, Peter Willett, David J. Wilton, Pierre Acklin, Kamal Azzaoui, Edgar Jacoby, Ansgar Schuffenhauer, Comparison of fingerprint-based methods for virtual screening using multiple bioactive reference structures. Journal of Chemical Information and Computer Sciences. ,vol. 44, pp. 1177- 1185 ,(2004) , 10.1021/CI034231B