Inequalities between multi-rater kappas

作者: Matthijs J. Warrens

DOI: 10.1007/S11634-010-0073-4

关键词: CombinatoricsKappaMathematicsPairwise comparisonFleiss' kappaCauchy–Schwarz inequalityMeasure (mathematics)Cohen's kappaCalculusUpper and lower bounds

摘要: The paper presents inequalities between four descriptive statistics that have been used to measure the nominal agreement two or more raters. Each of is a function pairwise information. Light's kappa and Hubert's are multi-rater versions Cohen's kappa. Fleiss' extension Scott's pi, whereas Randolph's generalizes Bennett et al. S multiple While consistent ordering numerical values these measures has frequently observed in practice, there thus far no theoretical proof general inequality among measures. It proved lower bound kappa, an upper if all tables weakly marginal symmetric raters assign certain minimum proportion objects specified category.

参考文章(42)
Mark Davies, Joseph L. Fleiss, Measuring Agreement for Multinomial Data Biometrics. ,vol. 38, pp. 1047- ,(1982) , 10.2307/2529886
Joseph L. Fleiss, Measuring nominal scale agreement among many raters. Psychological Bulletin. ,vol. 76, pp. 378- 382 ,(1971) , 10.1037/H0031619
Rebecca Zwick, Another look at interrater agreement. Psychological Bulletin. ,vol. 103, pp. 374- 378 ,(1988) , 10.1037/0033-2909.103.3.374
Frances P O'Malley, Syed K Mohsin, Sunil Badve, Shikha Bose, Laura C Collins, Marguerite Ennis, Celina G Kleer, Sarah E Pinder, Stuart J Schnitt, None, Interobserver reproducibility in the diagnosis of flat epithelial atypia of the breast Modern Pathology. ,vol. 19, pp. 172- 179 ,(2006) , 10.1038/MODPATHOL.3800514
Anthony J. Conger, Integration and generalization of kappas for multiple raters. Psychological Bulletin. ,vol. 88, pp. 322- 328 ,(1980) , 10.1037/0033-2909.88.2.322
Kenneth J. Berry, Paul W. Mielke, A Generalization of Cohen's Kappa Agreement Measure to Interval Measurement and Multiple Raters Educational and Psychological Measurement. ,vol. 48, pp. 921- 933 ,(1988) , 10.1177/0013164488484007
H. J. A. Schouten, Measuring Pairwise Agreement Among Many Observers. II. Some Improvements and Additions Biometrical Journal. ,vol. 24, pp. 431- 435 ,(1982) , 10.1002/BIMJ.4710240502
Richard J. Light, Measures of response agreement for qualitative data: Some generalizations and alternatives. Psychological Bulletin. ,vol. 76, pp. 365- 377 ,(1971) , 10.1037/H0031643
Matthijs J. Warrens, Inequalities Between Kappa and Kappa-Like Statistics for k×k Tables Psychometrika. ,vol. 75, pp. 176- 185 ,(2010) , 10.1007/S11336-009-9138-8