Clustering of Largely Right-Censored Oropharyngeal Head and Neck Cancer Patients for Discriminative Groupings to Improve Outcome Prediction

作者: Joel Tosado , Luka Zdilar , Hesham Elhalawani , Baher Elgohari , David M Vock

DOI: 10.1038/S41598-020-60140-0

关键词:

摘要: Clustering is the task of identifying groups similar subjects according to certain criteria. The AJCC staging system can be thought as a clustering mechanism that patients based on their disease stage. This grouping drives prognosis and influences treatment. goal this work evaluate efficacy machine learning algorithms cluster into discriminative improve for overall survival (OS) relapse free (RFS) outcomes. We apply over retrospectively collected data from 644 head neck cancer including both clinical radiomic features. In order incorporate outcome information process deal with large proportion censored samples, feature space was scaled using regression coefficients fitted proxy dependent variable, martingale residuals, instead follow-up time. Two clusters were identified evaluated cross validation. Kaplan Meier (KM) curves between two differ significantly OS RFS (p-value < 0.0001). Moreover, there relative predictive improvement when label in addition features compared only where AUC increased by 5.7% 13.0% RFS, respectively.

参考文章(44)
Lola Rahib, Benjamin D. Smith, Rhonda Aizenberg, Allison B. Rosenzweig, Julie M. Fleshman, Lynn M. Matrisian, Projecting Cancer Incidence and Deaths to 2030: The Unexpected Burden of Thyroid, Liver, and Pancreas Cancers in the United States Cancer Research. ,vol. 74, pp. 2913- 2921 ,(2014) , 10.1158/0008-5472.CAN-14-0155
Thomas A. Gerds, Martin Schumacher, Consistent Estimation of the Expected Brier Score in General Survival Models with Right-Censored Event Times Biometrical Journal. ,vol. 48, pp. 1029- 1040 ,(2006) , 10.1002/BIMJ.200610301
Philip S. Maclin, Jack Dempsey, Jay Brooks, John Rand, Using neural networks to diagnose cancer Journal of Medical Systems. ,vol. 15, pp. 11- 19 ,(1991) , 10.1007/BF00993877
Hemant Ishwaran, Udaya B. Kogalur, Eugene H. Blackstone, Michael S. Lauer, Random survival forests The Annals of Applied Statistics. ,vol. 2, pp. 841- 860 ,(2008) , 10.1214/08-AOAS169
R.M. Haralick, Statistical and structural approaches to texture Proceedings of the IEEE. ,vol. 67, pp. 786- 804 ,(1979) , 10.1109/PROC.1979.11328
T. M. THERNEAU, P. M. GRAMBSCH, T. R. FLEMING, Martingale-based residuals for survival models Biometrika. ,vol. 77, pp. 147- 160 ,(1990) , 10.1093/BIOMET/77.1.147
Friedrich Leisch, A toolbox for K-centroids cluster analysis Computational Statistics & Data Analysis. ,vol. 51, pp. 526- 544 ,(2006) , 10.1016/J.CSDA.2005.10.006
David Arthur, Sergei Vassilvitskii, k-means++: the advantages of careful seeding symposium on discrete algorithms. pp. 1027- 1035 ,(2007) , 10.5555/1283383.1283494