Application of fuzzy clustering analysis to compound datasets for drug lead identification

作者: Sinarwati Mohamad Suhaili , Mohamad Nazim Jambli , Abdul Rahman Mat

DOI: 10.1109/ICCISCI.2012.6297272

关键词:

摘要: Recently, the increasing number of chemical compound datasets to be screened has been growing rapidly due fast developments high-throughput screening in drug discovery. These requires selection methods which have become one main technique discovery especially lead identification process. Thus, finding best method is needed pharmaceutical industry ensure accurate results this One most used cluster-based selection, involves subdividing a set compounds into clusters and choosing or small from each cluster. In non-overlapping such as Ward's, Group Average, Jarvis Patrick's K-means are preferred cluster diverse compounds. However, there little study on overlapping fuzzy c-mean (FCM) c-varieties (FCV) clustering algorithms. Therefore, these two algorithms applied their performance compared based effectiveness terms separation between actives inactives (Pa) different mean intercluster molecular dissimilarity (MIMDS). The analysis shows FCM gives compare FCV Pa indicating that promising use But, perform better than term MIMDS when higher fuzziness index value concerned.

参考文章(15)
Costel Sârbu, Jürgen W. Einax, Study of traffic-emitted lead pollution of soil and plants using different fuzzy clustering algorithms. Analytical and Bioanalytical Chemistry. ,vol. 390, pp. 1293- 1301 ,(2008) , 10.1007/S00216-007-1711-3
Robert Gunderson, James Watson, James C. Bezdek, Chris Coray, DETECTION AND CHARACTERIZATION OF CLUSTER SUBSTRUCTURE I. LINEAR STRUCTURE: FUZZY c-LINES* Siam Journal on Applied Mathematics. ,vol. 40, pp. 339- 357 ,(1981) , 10.1137/0140029
Jacek M. Łęski, Fuzzy c-varieties/elliptotypes clustering in reproducing kernel Hilbert space Fuzzy Sets and Systems. ,vol. 141, pp. 259- 280 ,(2004) , 10.1016/S0165-0114(03)00184-2
György Barkó, János Abonyi, József Hlavay, Application of fuzzy clustering and piezoelectric chemical sensor array for investigation on organic compounds Analytica Chimica Acta. ,vol. 398, pp. 219- 226 ,(1999) , 10.1016/S0003-2670(99)00377-3
Robert D. Brown, Yvonne C. Martin, Use of Structure−Activity Data To Compare Structure-Based Clustering Methods and Descriptors for Use in Compound Selection Journal of Chemical Information and Computer Sciences. ,vol. 36, pp. 572- 584 ,(1996) , 10.1021/CI9501047
D. J. Wild, C. J. Blankley, Comparison of 2D fingerprint types and hierarchy level selection methods for structural grouping using Ward's clustering Journal of Chemical Information and Computer Sciences. ,vol. 40, pp. 155- 162 ,(2000) , 10.1021/CI990086J
P. G. Dittmar, N. A. Farmer, W. Fisanick, R. C. Haines, J. Mockus, The CAS ONLINE search system. 1. General system design and selection, generation, and use of search screens Journal of Chemical Information and Computer Sciences. ,vol. 23, pp. 93- 102 ,(1983) , 10.1021/CI00039A002