A Study on Comparison of Generalized Kappa Statistics in Agreement Analysis

作者: Min-Seon Kim , Ki-Jun Song , Chung-Mo Nam , In-Kyung Jung

DOI: 10.5351/KJAS.2012.25.5.719

关键词:

摘要: Agreement analysis is conducted to assess reliability among rating results performed repeatedly on the same subjects by one or more raters. The kappa statistic commonly used when scales are categorical. simple and weighted statistics measure degree of agreement between two raters, generalized than In this paper, we compare performance four different proposed Fleiss (1971), Conger (1980), Randolph (2005), Gwet (2008a). We also examine how sensitive each can be marginal probability distribution as whether balancedness and/or homogeneity hold not. methods compared in terms relative bias coverage rate through simulation studies various scenarios with numbers subjects, categories. A real data example presented illustrate methods.

参考文章(14)
A New Measure of Agreement to Resolve the Two Paradoxes of Cohen's Kappa The Korean Journal of applied Statistics. ,vol. 20, pp. 117- 132 ,(2007) , 10.5351/KJAS.2007.20.1.117
Joseph L. Fleiss, Measuring nominal scale agreement among many raters. Psychological Bulletin. ,vol. 76, pp. 378- 382 ,(1971) , 10.1037/H0031619
Anthony J. Conger, Integration and generalization of kappas for multiple raters. Psychological Bulletin. ,vol. 88, pp. 322- 328 ,(1980) , 10.1037/0033-2909.88.2.322
Kenneth J. Berry, Paul W. Mielke, A Generalization of Cohen's Kappa Agreement Measure to Interval Measurement and Multiple Raters Educational and Psychological Measurement. ,vol. 48, pp. 921- 933 ,(1988) , 10.1177/0013164488484007
Alvan R. Feinstein, Domenic V. Cicchetti, High agreement but low kappa: I. The problems of two paradoxes. Journal of Clinical Epidemiology. ,vol. 43, pp. 543- 549 ,(1990) , 10.1016/0895-4356(90)90158-L
Kilem Li Gwet, Computing inter‐rater reliability and its variance in the presence of high agreement British Journal of Mathematical and Statistical Psychology. ,vol. 61, pp. 29- 48 ,(2008) , 10.1348/000711006X126600
Jacob Cohen, A Coefficient of agreement for nominal Scales Educational and Psychological Measurement. ,vol. 20, pp. 37- 46 ,(1960) , 10.1177/001316446002000104
Harald Janson, Ulf Olsson, A Measure of Agreement for Interval or Nominal Multivariate Observations Educational and Psychological Measurement. ,vol. 61, pp. 277- 289 ,(2001) , 10.1177/00131640121971239