The Benefits of the Matthews Correlation Coefficient (MCC) Over the Diagnostic Odds Ratio (DOR) in Binary Classification Assessment

作者: Giuseppe Jurman , Davide Chicco , Valery Starovoitov

DOI: 10.1109/ACCESS.2021.3068614

关键词: Matthews correlation coefficientCorrelationConfusion matrixF1 scoreFalse positive paradoxContingency tableStatisticsDiagnostic odds ratioBinary classificationMathematics

摘要: To assess the quality of a binary classification, researchers often take advantage four-entry contingency table called confusion matrix , containing true positives, negatives, false and negatives. recap four values in unique score, statisticians have developed several rates metrics. In past, scientific studies already showed why Matthews correlation coefficient (MCC) is more informative trustworthy than confusion-entropy error, accuracy, F1 bookmaker informedness, markedness, balanced accuracy. this study, we compare MCC with diagnostic odds ratio (DOR), statistical rate employed sometimes biomedical sciences. After examining properties DOR, describe relationships between them, by also taking an innovative geometrical plot tetrahedron presented here for first time. We then report some use cases where DOR produce discordant outcomes, explain reliable two. Our results can strong impact computer science statistics, because they clearly trustworthiness information provided higher one generated ratio.

参考文章(41)
J M. Bland, Statistics Notes: The odds ratio BMJ. ,vol. 320, pp. 1468- 1468 ,(2000) , 10.1136/BMJ.320.7247.1468
Giuseppe Jurman, Samantha Riccadonna, Cesare Furlanello, A Comparison of MCC and CEN Error Measures in Multi-Class Prediction PLoS ONE. ,vol. 7, pp. e41882- ,(2012) , 10.1371/JOURNAL.PONE.0041882
E. B. Fowlkes, C. L. Mallows, A Method for Comparing Two Hierarchical Clusterings Journal of the American Statistical Association. ,vol. 78, pp. 553- 569 ,(1983) , 10.1080/01621459.1983.10478008
F. Boas, DETERMINATION OF THE COEFFICIENT OF CORRELATION. Science. ,vol. 29, pp. 823- 824 ,(1909) , 10.1126/SCIENCE.29.751.823
Markku Nurminen, To use or not to use the odds ratio in epidemiologic analyses European Journal of Epidemiology. ,vol. 11, pp. 365- 371 ,(1995) , 10.1007/BF01721219
Afina S. Glas, Jeroen G. Lijmer, Martin H. Prins, Gouke J. Bonsel, Patrick M.M. Bossuyt, The diagnostic odds ratio: a single indicator of test performance Journal of Clinical Epidemiology. ,vol. 56, pp. 1129- 1135 ,(2003) , 10.1016/S0895-4356(03)00177-X
E.A. Tsochatzis, K.S. Gurusamy, S. Ntaoula, E. Cholongitas, B.R. Davidson, A.K. Burroughs, Elastography for the diagnosis of severity of fibrosis in chronic liver disease: A meta-analysis of diagnostic accuracy Journal of Hepatology. ,vol. 54, pp. 650- 659 ,(2011) , 10.1016/J.JHEP.2010.07.033
Albert Orriols-Puig, Ester Bernadó-Mansilla, Evolutionary rule-based systems for imbalanced data sets soft computing. ,vol. 13, pp. 213- 225 ,(2008) , 10.1007/S00500-008-0319-7
P. Baldi, S. Brunak, Y. Chauvin, C. A. F. Andersen, H. Nielsen, Assessing the accuracy of prediction algorithms for classification: an overview Bioinformatics. ,vol. 16, pp. 412- 424 ,(2000) , 10.1093/BIOINFORMATICS/16.5.412