摘要: In gene expression microarray data analysis, selecting a small number of discriminative genes from thousands is an important problem for accurate classification diseases or phenotypes. The becomes particularly challenging due to the large features (genes) and sample size. Traditional selection methods often select top-ranked according their individual power without handling high degree redundancy among genes. Latest research shows that removing redundant selected ones can achieve better representation characteristics targeted phenotypes lead improved accuracy. Hence, we study in this paper relationship between feature relevance propose efficient method effectively remove efficiency effectiveness our comparison with representative has been demonstrated through empirical using public sets.