Evolutionary soft co-clustering: formulations, algorithms, and applications

作者: Wenlu Zhang , Rongjian Li , Daming Feng , Andrey Chernikov , Nikos Chrisochoides

DOI: 10.1007/S10618-014-0375-9

关键词:

摘要: We consider the co-clustering of time-varying data using evolutionary methods. Existing approaches are based on spectral learning framework, thus lacking a probabilistic interpretation. overcome this limitation by developing model in paper. The proposed assumes that observed generated via two-step process depends historic co-clusters. This allows us to capture temporal smoothness probabilistically principled manner. To perform maximum likelihood parameter estimation, we present an EM-based algorithm. also establish convergence EM An appealing feature is it leads soft assignments naturally. evaluate method both synthetic and real-world sets. Experimental results show our consistently outperforms prior method. fully exploit impact methods, further systematic application study analysis Drosophila gene expression pattern images. encode spatial information at particular developmental time point into matrix mesh-generation pipeline. then co-cluster embryonic domains genes simultaneously for multiple points Results co-clusters reflect underlying biology.

参考文章(54)
Dacheng Tao, Jun Li, A Bayesian factorised covariance model for image analysis international joint conference on artificial intelligence. pp. 1465- 1471 ,(2013)
Fei Wang, Chenhao Tan, Ping Li, Arnd Christian König, Efficient document clustering via online nonnegative matrix factorizations siam international conference on data mining. pp. 908- 919 ,(2011)
Changshui Zhang, Fei Wang, Tao Li, Semi-Supervised Clustering via Matrix Factorization. siam international conference on data mining. pp. 1- 12 ,(2008)
George M. Church, Yizong Cheng, Biclustering of Expression Data intelligent systems in molecular biology. ,vol. 8, pp. 93- 103 ,(2000)
Volker Hartenstein, Atlas of Drosophila Development ,(1995)
Hanghang Tong, Spiros Papadimitriou, Philip S. Yu, Christos Faloutsos, Proximity tracking on time-evolving bipartite graphs siam international conference on data mining. pp. 704- 715 ,(2008) , 10.1137/1.9781611972788.64
Daniel D. Lee, H. Sebastian Seung, Learning the parts of objects by non-negative matrix factorization Nature. ,vol. 401, pp. 788- 791 ,(1999) , 10.1038/44565
Angelike Stathopoulos, Michael Levine, Genomic Regulatory Networks and Animal Development Developmental Cell. ,vol. 9, pp. 449- 462 ,(2005) , 10.1016/J.DEVCEL.2005.09.005
Oren E. Livne, Gene H. Golub, Scaling by Binormalization Numerical Algorithms. ,vol. 35, pp. 97- 120 ,(2004) , 10.1023/B:NUMA.0000016606.32820.69
Gene H. Golub, Charles F. Van Loan, Matrix computations (3rd ed.) Johns Hopkins University Press. ,(1996)