Algorithms for Internal Validation Clustering Measures in the Post Genomic Era

作者: Filippo Utro

DOI:

关键词:

摘要: Inferring cluster structure in microarray datasets is a fundamental task for the -omic sciences. A question Statistics, Data Analysis and Classification, prediction of number clusters dataset, usually established via internal validation measures. Despite wealth measures available literature, new ones have been recently proposed, some them specifically data. In this dissertation, study given, paying particular attention to stability based ones. Indeed, class particularly prominent promising order reliable estimate dataset. For those measures, general algorithmic paradigm proposed here that highlights richness accounts already literature. Moreover, most representative are also considered. Experiments on 12 benchmark performed assess both intrinsic ability measure predict correct dataset its merit relative other The main result hierarchy terms precision speed, highlighting their merits limitations not reported before This shows faster measure, less accurate it is. reduce time performance gap between fastest precise technique designing fast approximation algorithms systematically applied. end speed-up many studied brings within one magnitude time, with no degradation power. Prior work, was at least two orders magnitude.

参考文章(151)
Swagatam Das, Sambarta Dasgupta, Arijit Biswas, Ajith Abraham, Amit Konar, None, On Stability of the Chemotactic Dynamics in Bacterial-Foraging Optimization Algorithm systems man and cybernetics. ,vol. 39, pp. 670- 679 ,(2009) , 10.1109/TSMCA.2008.2011474
Shun-ichi Amari, Andrzej Cichocki, Adaptive Blind Signal and Image Processing: Learning Algorithms and Applications John Wiley & Sons, Inc.. ,(2002)
Purushottam Kar, Manjish Pal, Arnab Bhattacharya, On Low Distortion Embeddings of Statistical Distance Measures into Low Dimensional Spaces database and expert systems applications. ,vol. 5690, pp. 164- 172 ,(2009) , 10.1007/978-3-642-03573-9_13
A. D. Gordon, Null Models in Cluster Validation Springer, Berlin, Heidelberg. pp. 32- 44 ,(1996) , 10.1007/978-3-642-79999-0_3
Guoli Wang, Andrew V Kossenkov, Michael F Ochs, LS-NMF: A modified non-negative matrix factorization algorithm utilizing uncertainty estimates BMC Bioinformatics. ,vol. 7, pp. 175- 175 ,(2006) , 10.1186/1471-2105-7-175
Genomic Signal Processing and Statistics Hindawi Publishing Corporation. ,(2005) , 10.1155/9789775945075
Allan D. Gordon, Clustering Algorithms and Cluster Validation Physica-Verlag HD. pp. 497- 512 ,(1994) , 10.1007/978-3-642-57991-2_29
Brian S. Everitt, Sabine Landau, Morven Leese, Cluster Analysis ,(1974)
D. Middleton, John A. Rice, Mathematical Statistics and Data Analysis The Mathematical Gazette. ,vol. 72, pp. 330- 331 ,(1988) , 10.2307/3619963