Extreme Logistic Regression: A Large Scale Learning Algorithm with Application to Prostate Cancer Mortality Prediction

作者: Janusz Wojtusiak , Talha Oz , Che Ngufor , Andrea Hooker , Jack Hadley

DOI:

关键词:

摘要: With the recent popularity of electronic medical records, enormous amount data is being generated every day at an exponential rate.Machine learning methods have been shown in many studies to be capable producing automatic diagnostic models such as automated prognostic models. However, powerful machine algorithms support vector (SVM), Random Forest (RF) or Kernel Logistic Regression (KLR) are unbearably slow for very large datasets. This makes their use research limited small medium scale problems.This study motivated by ongoing on prostate cancer mortality prediction a national representative US population where SVM and RF took several hours days trainwhereas simple linear logistic regression discriminant analysis take minutes even seconds.Because, most real-world problems non-linear, this paper presents algorithm enabling recently proposed least squares extreme learn The case men diagnosed with early stage provide fast more accurate result than standard statistical methods.

参考文章(12)
Gregory Merrick, Kent Wallner, Michael Dattoli, Al Taira, Reviving the acid phosphatase test for prostate cancer. Oncology. ,vol. 21, pp. 1003- 1010 ,(2007)
Qiuge Liu, Qing He, Zhongzhi Shi, Extreme support vector machine classifier knowledge discovery and data mining. pp. 222- 233 ,(2008) , 10.1007/978-3-540-68125-0_21
Mansour T. A. Sharabiani, Paul Aylin, Alex Bottle, Systematic review of comorbidity indices for administrative data. Medical Care. ,vol. 50, pp. 1109- 1118 ,(2012) , 10.1097/MLR.0B013E31825F64D0
Carrie N Klabunde, Peter B Bach, Joan L Warren, Deborah Schrag, Gerald F Riley, Overview of the SEER-Medicare data: content, research applications, and generalizability to the United States elderly population. Medical Care. ,vol. 40, ,(2002) , 10.1097/01.MLR.0000020942.47004.03
Guang-Bin Huang, Hongming Zhou, Xiaojian Ding, Rui Zhang, Extreme Learning Machine for Regression and Multiclass Classification systems man and cybernetics. ,vol. 42, pp. 513- 529 ,(2012) , 10.1109/TSMCB.2011.2168604
James M. Robins, Andrea Rotnitzky, Lue Ping Zhao, Estimation of Regression Coefficients When Some Regressors are not Always Observed Journal of the American Statistical Association. ,vol. 89, pp. 846- 866 ,(1994) , 10.1080/01621459.1994.10476818
Anne Elixhauser, Claudia Steiner, D. Robert Harris, Rosanna M. Coffey, Comorbidity measures for use with administrative data. Medical Care. ,vol. 36, pp. 8- 27 ,(1998) , 10.1097/00005650-199801000-00004
Guang-Bin Huang, Qin-Yu Zhu, Chee-Kheong Siew, Extreme learning machine: Theory and applications Neurocomputing. ,vol. 70, pp. 489- 501 ,(2006) , 10.1016/J.NEUCOM.2005.12.126