Semi-Supervised Projective Non-Negative Matrix Factorization for Cancer Classification

作者: Xiang Zhang , Naiyang Guan , Zhilong Jia , Xiaogang Qiu , Zhigang Luo

DOI: 10.1371/JOURNAL.PONE.0138814

关键词:

摘要: Advances in DNA microarray technologies have made gene expression profiles a significant candidate identifying different types of cancers. Traditional learning-based cancer identification methods utilize labeled samples to train classifier, but they are inconvenient for practical application because labels quite expensive the clinical research community. This paper proposes semi-supervised projective non-negative matrix factorization method (Semi-PNMF) learn an effective classifier from both and unlabeled samples, thus boosting subsequent classification performance. In particular, Semi-PNMF jointly learns subspace concatenated indicates classes by positions maximum entries their coefficients. Because incorporates statistical information large volume learned subspace, it can more representative subspaces boost We developed multiplicative update rule (MUR) optimize proved its convergence. The experimental results two multiclass profile datasets show that outperforms methods.

参考文章(42)
R. Cowell Z. Ghahramani, A Zien, O Chapelle, Semi-Supervised Classification by Low Density Separation international conference on artificial intelligence and statistics. pp. 57- 64 ,(2005)
He Zhang, Zhirong Yang, Erkki Oja, Adaptive multiplicative updates for projective nonnegative matrix factorization international conference on neural information processing. pp. 277- 284 ,(2012) , 10.1007/978-3-642-34487-9_34
Lawrence K. Saul, Youngmin Cho, Nonnegative Matrix Factorization for Semi-supervised Dimensionality Reduction arXiv: Learning. ,(2011)
Daniel D. Lee, H. Sebastian Seung, Learning the parts of objects by non-negative matrix factorization Nature. ,vol. 401, pp. 788- 791 ,(1999) , 10.1038/44565
Matan Hofree, John P Shen, Hannah Carter, Andrew Gross, Trey Ideker, Network based stratification of tumor mutations Nature Methods. ,vol. 10, pp. 1108- 1115 ,(2015) , 10.1038/NMETH.2651
Yongxi Tan, Leming Shi, Weida Tong, G.T. Gene Hwang, Charles Wang, Multi-class tumor classification by discriminant partial least squares using microarray gene expression data and assessment of classification models Computational Biology and Chemistry. ,vol. 28, pp. 235- 243 ,(2004) , 10.1016/J.COMPBIOLCHEM.2004.05.002
Behrouz Madahian, Lih Y Deng, Ramin Homayouni, None, Application of Sparse Bayesian Generalized Linear Model to Gene Expression Data for Classification of Prostate Cancer Subtypes Open Journal of Statistics. ,vol. 2014, pp. 518- 526 ,(2014) , 10.4236/OJS.2014.47049
U. Maulik, A. Mukhopadhyay, D. Chakraborty, Gene-Expression-Based Cancer Subtypes Prediction Through Feature Selection and Transductive SVM IEEE Transactions on Biomedical Engineering. ,vol. 60, pp. 1111- 1117 ,(2013) , 10.1109/TBME.2012.2225622
Naiyang Guan, Xuhui Huang, Long Lan, Zhigang Luo, Xiang Zhang, Graph Based Semi-supervised Non-negative Matrix Factorization for Document Clustering international conference on machine learning and applications. ,vol. 1, pp. 404- 408 ,(2012) , 10.1109/ICMLA.2012.73
Clare M. Lee, Manikhandan A. V. Mudaliar, D. R. Haggart, C. Roland Wolf, Gino Miele, J. Keith Vass, Desmond J. Higham, Daniel Crowther, Simultaneous Non-Negative Matrix Factorization for Multiple Large Scale Gene Expression Datasets in Toxicology PLoS ONE. ,vol. 7, pp. e48238- ,(2012) , 10.1371/JOURNAL.PONE.0048238