Diversity of decision-making models and the measurement of interrater agreement.

作者: John S. Uebersax

DOI: 10.1037/0033-2909.101.1.140

关键词:

摘要: Several papers have appeared criticizing the kappa coefficient because of its tendency to fluctuate with sample base rates. The importance these criticisms is difficult evaluate they are presented regards a highly specific model diagnostic decision making. In this article, making viewed as special case signal detection theory. Each process characterized by function that relates probability receiving positive diagnosis severity or salience symptoms. shape diagnosability curve greatly affects value obtained in study interrater reliability, how it changes response variation rates, and closely corresponds validity decisions. common practice evaluating procedure, when criterion diagnoses for comparison unavailable, on basis magnitude observed reliability questionable. New methods measuring agreement necessary, possible directions research area discussed.

参考文章(17)
John Arthur Swets, Ronald M. Pickett, Evaluation of diagnostic systems : methods from signal detection theory Academic Press. ,(1982)
Lee B. Lusted, Introduction to medical decision making Charles C. Thomas. ,(1968)
Edward L. Spitznagel, A Proposed Solution to the Base Rate Problem in the Kappa Statistic Archives of General Psychiatry. ,vol. 42, pp. 725- 728 ,(1985) , 10.1001/ARCHPSYC.1985.01790300093012
Charles E. Metz, Basic principles of ROC analysis Seminars in Nuclear Medicine. ,vol. 8, pp. 283- 298 ,(1978) , 10.1016/S0001-2998(78)80014-2
Joseph L. Fleiss, Measuring nominal scale agreement among many raters. Psychological Bulletin. ,vol. 76, pp. 378- 382 ,(1971) , 10.1037/H0031619
Martin A. Tanner, Michael A. Young, Modeling Agreement among Raters Journal of the American Statistical Association. ,vol. 80, pp. 175- 180 ,(1985) , 10.1080/01621459.1985.10477157
Kenneth Kaye, None, Estimating False Alarms and Missed Events From Interobserver Agreement: A Rationale Psychological Bulletin. ,vol. 88, pp. 458- 468 ,(1980) , 10.1037//0033-2909.88.2.458
John A. Swets, Indices of discrimination or diagnostic accuracy: their ROCs and implied models. Psychological Bulletin. ,vol. 99, pp. 100- 117 ,(1986) , 10.1037/0033-2909.99.1.100
Cynthia L. Janes, An extension of the Random Error Coefficient of Agreement to N x N tables. British Journal of Psychiatry. ,vol. 134, pp. 617- 619 ,(1979) , 10.1192/BJP.134.6.617
Jacob Cohen, A Coefficient of agreement for nominal Scales Educational and Psychological Measurement. ,vol. 20, pp. 37- 46 ,(1960) , 10.1177/001316446002000104