Assessing Agreement Among Raters And Identifying Atypical Raters Using A Log-Linear Modeling Approach

作者: Kari B. Kastango

DOI:

关键词:

摘要: When an outcome is rated by several raters, ensuring consistency across raters increases the reliability of measurement. Tanner and Young (1985) proposed a general class log-linear models to assess agreement among K rating scale with C nominal categories. Their methodology can be used pair-wise three or more raters. Rogel et al. (1996, 1998) extended this work assessing various patterns rater sub-groups size K-1. These test assumption exchangeability. Although parameters from these identify atypical no formal inferential procedures are available. I propose approach that exchangeability rater. The global heterogeneous partial model fit data comparisons made, adjusting p-values for multiple made. parameter constantly involved in statistically significant distinguished. premise that, if there rater, at least one will differ remaining K-1 parameters. illustrated using published intestinal biopsy study six (Rogel al., 1998). Overall Type error power correctly assessed via simulation 5. Bonferroni, Sidak, Holm's Step-down Bonferroni Sidak adjustments control overall error. Being able present, improving ratings directly, influence measurement given sample size. Consequently, informative studies conducted interventions (e.g., behavioral, medicinal) may have positive impact on public's health.

参考文章(23)
Joseph L. Fleiss, Jacob Cohen, B. S. Everitt, Large sample standard errors of kappa and weighted kappa. Psychological Bulletin. ,vol. 72, pp. 323- 327 ,(1969) , 10.1037/H0028106
Martin A. Tanner, Michael A. Young, Modeling Agreement among Raters Journal of the American Statistical Association. ,vol. 80, pp. 175- 180 ,(1985) , 10.1080/01621459.1985.10477157
A. Rogel, P. Y. Boëlle, J. Y. Mary, Global and partial agreement among several observers. Statistics in Medicine. ,vol. 17, pp. 489- 501 ,(1998) , 10.1002/(SICI)1097-0258(19980228)17:4<489::AID-SIM751>3.0.CO;2-9
Surekha Mudivarthy, M Bhaskara Rao, Model Selection and Inference Technometrics. ,vol. 42, pp. 319- 319 ,(2000) , 10.1080/00401706.2000.10486070
Y. Hochberg, A. C. Tamhane, Multiple Comparison Procedures ,(1987)
A Theodossi, D J Spiegelhalter, J Jass, J Firth, M Dixon, M Leader, D A Levison, R Lindley, I Filipe, A Price, Observer variation and discriminatory value of biopsy features in inflammatory bowel disease. Gut. ,vol. 35, pp. 961- 968 ,(1994) , 10.1136/GUT.35.7.961
Juliet Popper Shaffer, Modified Sequentially Rejective Multiple Test Procedures Journal of the American Statistical Association. ,vol. 81, pp. 826- 831 ,(1986) , 10.1080/01621459.1986.10478341
Zbyněk Šidák, Rectangular Confidence Regions for the Means of Multivariate Normal Distributions Journal of the American Statistical Association. ,vol. 62, pp. 626- 633 ,(1967) , 10.1080/01621459.1967.10482935