A New Unsupervised Feature Ranking Method for Gene Expression Data Based on Consensus Affinity

作者: Shaohong Zhang , Hau-San Wong , Ying Shen , Dongqing Xie

DOI: 10.1109/TCBB.2012.34

关键词:

摘要: Feature selection is widely established as one of the fundamental computational techniques in mining microarray data. Due to lack categorized information practice, unsupervised feature more practically important but correspondingly difficult. Motivated by cluster ensemble techniques, which combine multiple clustering solutions into a consensus solution higher accuracy and stability, recent efforts proposed use these oracles. However, methods are dependent on both particular algorithm used knowledge true number. These will be unsuitable when number not available, common practice. In view above problems, new ranking method evaluate importance features based affinity. Different from previous works, our compares corresponding affinity each between pair instances matrix solutions. As result, alleviates need know clusters dependence approaches works. Experiments real gene expression data sets demonstrate significant improvement results compared several state-of-the-art techniques.

参考文章(41)
John Quackenbush, Computational analysis of microarray data Nature Reviews Genetics. ,vol. 2, pp. 418- 427 ,(2001) , 10.1038/35076576
Mark Andrew Hall, Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning international conference on machine learning. pp. 359- 366 ,(2000)
Huan Liu, Lei Yu, Feature selection for high-dimensional data: a fast correlation-based filter solution international conference on machine learning. pp. 856- 863 ,(2003)
Xiaoli Zhang Fern, Carla E. Brodley, Solving cluster ensemble problems by bipartite graph partitioning Twenty-first international conference on Machine learning - ICML '04. pp. 36- ,(2004) , 10.1145/1015330.1015414
Javed A. Aslam, Mark Montague, Models for metasearch international acm sigir conference on research and development in information retrieval. pp. 276- 284 ,(2001) , 10.1145/383952.384007
Yi Hong, Sam Kwong, Yuchou Chang, Qingsheng Ren, Consensus unsupervised feature ranking from multiple views Pattern Recognition Letters. ,vol. 29, pp. 595- 602 ,(2008) , 10.1016/J.PATREC.2007.11.012
Ron Kohavi, George H. John, Wrappers for feature subset selection Artificial Intelligence. ,vol. 97, pp. 273- 324 ,(1997) , 10.1016/S0004-3702(97)00043-X
NJ Nicola Armstrong, MA Mark van de Wiel, Microarray data analysis: From hypotheses to conclusions using gene expression data Cellular Oncology. ,vol. 26, pp. 279- 290 ,(2004) , 10.1155/2004/943940
William M. Rand, Objective Criteria for the Evaluation of Clustering Methods Journal of the American Statistical Association. ,vol. 66, pp. 846- 850 ,(1971) , 10.1080/01621459.1971.10482356