Diversity of decision-making models and the measurement of interrater agreement.

作者： John S. Uebersax

关键词:

摘要: Several papers have appeared criticizing the kappa coefficient because of its tendency to fluctuate with sample base rates. The importance these criticisms is difficult evaluate they are presented regards a highly specific model diagnostic decision making. In this article, making viewed as special case signal detection theory. Each process characterized by function that relates probability receiving positive diagnosis severity or salience symptoms. shape diagnosability curve greatly affects value obtained in study interrater reliability, how it changes response variation rates, and closely corresponds validity decisions. common practice evaluating procedure, when criterion diagnoses for comparison unavailable, on basis magnitude observed reliability questionable. New methods measuring agreement necessary, possible directions research area discussed.

参考文章(17)

John Arthur Swets, Ronald M. Pickett, Evaluation of diagnostic systems : methods from signal detection theory Academic Press. ,(1982)

Lee B. Lusted, Introduction to medical decision making Charles C. Thomas. ,(1968)

Edward L. Spitznagel, A Proposed Solution to the Base Rate Problem in the Kappa Statistic Archives of General Psychiatry. ,vol. 42, pp. 725- 728 ,(1985) , 10.1001/ARCHPSYC.1985.01790300093012

Charles E. Metz, Basic principles of ROC analysis Seminars in Nuclear Medicine. ,vol. 8, pp. 283- 298 ,(1978) , 10.1016/S0001-2998(78)80014-2

Joseph L. Fleiss, Measuring nominal scale agreement among many raters. Psychological Bulletin. ,vol. 76, pp. 378- 382 ,(1971) , 10.1037/H0031619

Martin A. Tanner, Michael A. Young, Modeling Agreement among Raters Journal of the American Statistical Association. ,vol. 80, pp. 175- 180 ,(1985) , 10.1080/01621459.1985.10477157

Kenneth Kaye, None, Estimating False Alarms and Missed Events From Interobserver Agreement: A Rationale Psychological Bulletin. ,vol. 88, pp. 458- 468 ,(1980) , 10.1037//0033-2909.88.2.458

John A. Swets, Indices of discrimination or diagnostic accuracy: their ROCs and implied models. Psychological Bulletin. ,vol. 99, pp. 100- 117 ,(1986) , 10.1037/0033-2909.99.1.100

Cynthia L. Janes, An extension of the Random Error Coefficient of Agreement to N x N tables. British Journal of Psychiatry. ,vol. 134, pp. 617- 619 ,(1979) , 10.1192/BJP.134.6.617

10.

Jacob Cohen, A Coefficient of agreement for nominal Scales Educational and Psychological Measurement. ,vol. 20, pp. 37- 46 ,(1960) , 10.1177/001316446002000104

Diversity of decision-making models and the measurement of interrater agreement.

来源期刊

我的账户

Diversity of decision-making models and the measurement of interrater agreement.

来源期刊

相似文章 10

我的账户