Classification Using Generalized Partial Least Squares

作者: Beiying Ding , Robert Gentleman

DOI: 10.1198/106186005X47697

关键词:

摘要: Advances in computational biology have made simultaneous monitoring of thousands features possible. The high throughput technologies not only bring about a much richer information context which to study various aspects gene function, but they also present the challenge analyzing data with large number covariates and few samples. As an integral part machine learning, classification samples into two or more categories is almost always interest scientists. We address question this setting by extending partial least squares (PLS), popular dimension reduction tool chemometrics, generalized linear regression, based on previous approach, iteratively reweighted squares, that is, IRWPLS. compare our results two-stage PLS other classifiers. show phrasing problem model applying Firth's procedure avoid (quasi)separation, we often get lower classificatio...

参考文章(39)
David Firth, Bias reduction, the Jeffreys prior and GLIM Advances in GLIM and Statistical Modelling. pp. 91- 100 ,(1992) , 10.1007/978-1-4612-2952-0_15
W. N. Venables, B. D. Ripley, Modern Applied Statistics with S Springer. ,(2010) , 10.1007/978-0-387-21706-2
Peter McCullagh, John Ashworth Nelder, Generalized Linear Models ,(1983)
W.C. Knowler, R.S. Johannes, J.E. Everhart, W.C. Dickson, Jack W. Smith, Using the ADAP Learning Algorithm to Forecast the Onset of Diabetes Mellitus annual symposium on computer application in medical care. pp. 261- 265 ,(1988)
C. Yalçın Yıldırım, A note on ”() and ”’() Proceedings of the American Mathematical Society. ,vol. 124, pp. 2311- 2314 ,(1996) , 10.1090/S0002-9939-96-03755-0
Agnar Höskuldsson, PLS regression methods Journal of Chemometrics. ,vol. 2, pp. 211- 228 ,(1988) , 10.1002/CEM.1180020306
Sandrine Dudoit, Jane Fridlyand, Terence P Speed, None, Comparison of discrimination methods for the classification of tumors using gene expression data Journal of the American Statistical Association. ,vol. 97, pp. 77- 87 ,(2002) , 10.1198/016214502753479248
Paul H. C. Eilers, Judith M. Boer, Gert-Jan van Ommen, Hans C. van Houwelingen, Classification of microarray data with penalized logistic regression Microarrays : optical technologies and informatics. Conference. ,vol. 4266, pp. 187- 198 ,(2001) , 10.1117/12.427987
A. ALBERT, J. A. ANDERSON, On the existence of maximum likelihood estimates in logistic regression models Biometrika. ,vol. 71, pp. 1- 10 ,(1984) , 10.1093/BIOMET/71.1.1
Douglas T Ross, Uwe Scherf, Michael B Eisen, Charles M Perou, Christian Rees, Paul Spellman, Vishwanath Iyer, Stefanie S Jeffrey, Matt Van de Rijn, Mark Waltham, Alexander Pergamenschikov, JC Lee, Deval Lashkari, Dari Shalon, Timothy G Myers, John N Weinstein, David Botstein, Patrick O Brown, None, Systematic variation in gene expression patterns in human cancer cell lines. Nature Genetics. ,vol. 24, pp. 227- 235 ,(2000) , 10.1038/73432