Efficient statistical tests to compare Youden index: accounting for contingency correlation

作者: Fangyao Chen , Yuqiang Xue , Ming T. Tan , Pingyan Chen

DOI: 10.1002/SIM.6432

关键词: MathematicsYouden's J statisticNominal levelStatisticsCohen's kappaEconometricsDelta methodContingency tableAccountingSample (statistics)Statistical inferenceStatistical hypothesis testing

摘要: Youden index is widely utilized in studies evaluating accuracy of diagnostic tests and performance predictive, prognostic, or risk models. However, both one two independent sample on have been derived ignoring the dependence (association) between sensitivity specificity, resulting potentially misleading findings. Besides, paired test currently unavailable. This article develops efficient statistical inference procedures for sample, independent, by accounting contingency correlation, namely associations specificity samples typically represented tables. For tests, variances are estimated Delta method, based central limit theory, which then verified bootstrap estimates. test, we show that covariance sensitivities specificities can be as a function kappa statistic so readily carried out. We remarkable variance using constrained optimization approach. Simulation performed to evaluate properties tests. The proposed approaches yield more stable type I errors at nominal level substantially higher power (efficiency) than does original Youden's Therefore, simple explicit large solution performs very well. Because implement asymptotic exact computation with common software like R, method broadly applicable evaluation model performance.

参考文章(37)
Robert F. Wagner, Sergey V. Beiden, Charles E. Metz, Continuous versus categorical data for ROC analysis: some quantitative considerations. Academic Radiology. ,vol. 8, pp. 328- 334 ,(2001) , 10.1016/S1076-6332(03)80502-0
Comparing the Accuracy of Two Diagnostic Tests John Wiley & Sons, Inc.. pp. 165- 192 ,(2008) , 10.1002/9780470317082.CH5
Gary W. Oehlert, A note on the delta method The American Statistician. ,vol. 46, pp. 27- 29 ,(1992) , 10.1080/00031305.1992.10475842
Liansheng Tang, Scott S Emerson, Xiao-Hua Zhou, None, Nonparametric and Semiparametric Group Sequential Methods for Comparing Accuracy of Diagnostic Tests Biometrics. ,vol. 64, pp. 1137- 1145 ,(2008) , 10.1111/J.1541-0420.2008.01000.X
J. Swets, Measuring the accuracy of diagnostic systems Science. ,vol. 240, pp. 1285- 1293 ,(1988) , 10.1126/SCIENCE.3287615
Brandy M Ringham, Todd A Alonzo, Gary K Grunwald, Deborah H Glueck, Estimates of sensitivity and specificity can be biased when reporting the results of the second test in a screening trial conducted in series BMC Medical Research Methodology. ,vol. 10, pp. 3- 3 ,(2010) , 10.1186/1471-2288-10-3
J Dinnes, J Deeks, J Kirby, P Roderick, A methodological review of how heterogeneity has been examined in systematic reviews of diagnostic test accuracy Health Technology Assessment. ,vol. 9, pp. 1- ,(2005) , 10.3310/HTA9120
S. D. Walter, A. R. Jadad, Meta-analysis of screening data: a survey of the literature Statistics in Medicine. ,vol. 18, pp. 3409- 3424 ,(1999) , 10.1002/(SICI)1097-0258(19991230)18:24<3409::AID-SIM377>3.0.CO;2-8
Charles E. Metz, David J. Goodenough, Kurt Rossmann, Evaluation of Receiver Operating Characteristic Curve Data in Terms of Information Theory, with Applications in Radiography Radiology. ,vol. 109, pp. 297- 303 ,(1973) , 10.1148/109.2.297