Efficient mining of discriminative co-clusters from gene expression data

作者: Omar Odibat , Chandan K. Reddy

DOI: 10.1007/S10115-013-0684-0

关键词:

摘要: Discriminative models are used to analyze the differences between two classes and identify class-specific patterns. Most of existing discriminative depend on using entire feature space compute patterns for each class. Co-clustering has been proposed capture that correlated in a subset features, but it cannot handle labeled datasets. In certain biological applications such as gene expression analysis, is critical consider only space. The objective this paper twofold: first, presents an algorithm efficiently find arbitrarily positioned co-clusters from complex data. Second, extends co-clustering discover by incorporating class information into co-cluster search process. addition, we also characterize propose three novel measures can be evaluate performance any subspace pattern-mining algorithm. We evaluated algorithms several synthetic real datasets, our experimental results showed outperformed available literature.

参考文章(41)
Ruggero G. Pensa, Jean-François Boulicaut, Constrained Co-clustering of Gene Expression Data siam international conference on data mining. pp. 25- 36 ,(2008)
John Skilling, S. F. Gull, Algorithms and Applications Springer, Dordrecht. pp. 83- 132 ,(1985) , 10.1007/978-94-017-2221-6_5
Gilles Bisson, Syed Fawad Hussain, Text Categorization Using Word Similarities Based on Higher Order Co-occurrences. siam international conference on data mining. pp. 1- 12 ,(2010)
George M. Church, Yizong Cheng, Biclustering of Expression Data intelligent systems in molecular biology. ,vol. 8, pp. 93- 103 ,(2000)
Arindam Banerjee, Hanhuai Shan, Residual Bayesian co-clustering for matrix approximation siam international conference on data mining. pp. 223- 234 ,(2010)
Yangqiu Song, Shimei Pan, Shixia Liu, Weihong Qian, Furu Wei, Michelle X. Zhou, Constrained co-clustering for textual documents national conference on artificial intelligence. pp. 581- 586 ,(2010)
Mohammad S. Aziz, Chandan K. Reddy, A robust seedless algorithm for correlation clustering knowledge discovery and data mining. pp. 28- 37 ,(2010) , 10.1007/978-3-642-13657-3_6
Tobey J. MacDonald, Kevin M. Brown, Bonnie LaFleur, Katia Peterson, Christopher Lawlor, Yidong Chen, Roger J. Packer, Philip Cogen, Dietrich A. Stephan, Expression profiling of medulloblastoma: PDGFRA and the RAS/MAPK pathway as therapeutic targets for metastatic disease Nature Genetics. ,vol. 29, pp. 143- 152 ,(2001) , 10.1038/NG731
George Karypis, Michael Steinbach, Vipin Kumar, A Comparison of Document Clustering Techniques ,(2000)