Confirmatory factor analysis and item response theory : two approaches for exploring measurement invariance

作者: Steven P. Reise , Keith F. Widaman , Robin H. Pugh

DOI: 10.1037/0033-2909.114.3.552

关键词: Social psychologyMeasurement invarianceTraitPsychological testingItem response theoryTest (assessment)Cognitive psychologyConfirmatory factor analysisPsychometricsComparabilityPsychology

摘要: This study investigated the utility of confirmatory factor analysis (CFA) and item response theory (IRT) models for testing comparability psychological measurements. Both procedures were used to investigate whether mood ratings collected in Minnesota China comparable. Several issues addressed. The first issue was that establishing a common measurement scale across groups, which involves full or partial invariance trait indicators. It is shown using CFA IRT models, test items function differentially as indicators groups need not interfere with comparing examinees on same dimension. Second, model fit proposed person-fit statistics be judge practical models. Finally, topics future research are suggested. Much debate has been motivated by question how establish measures dimension, way, when administered two more qualitatively distinct (e.g., men women). can also posed follows: Are scores individuals who belong different examinee populations comparable scale? objectives this review linear analysis' (CFA; Long, 1983) (IRT; Lord, 1980) approaches addressing important suggest, way real-data application, advantages disadvantages each approach.

参考文章(69)
Fritz Drasgow, Michael V. Levine, Mary E. McLaughlin, Appropriateness Measurement for Some Multidimensional Test Batteries. Applied Psychological Measurement. ,vol. 15, pp. 171- 191 ,(1991) , 10.1177/014662169101500207
David Thissen, Lynne Steinberg, Data analysis using item response theory. Psychological Bulletin. ,vol. 104, pp. 385- 395 ,(1988) , 10.1037/0033-2909.104.3.385
Peter M. Bentler, EQS : structural equations program manual BMDP Statistical Software. ,(1989)
BENJAMIN D. WRIGHT, Solving measurement problems with the Rasch model. Journal of Educational Measurement. ,vol. 14, pp. 97- 116 ,(1977) , 10.1111/J.1745-3984.1977.TB00031.X
Cecil R. Reynolds, Richard E. Harding, Outcome in Two Large Sample Studies of Factorial Similarity under Six Methods of Comparison. Educational and Psychological Measurement. ,vol. 43, pp. 723- 728 ,(1983) , 10.1177/001316448304300305
Gregory L. Candell, Fritz Drasgow, An Iterative Procedure for Linking Metrics and Assessing Item Bias in Item Response Theory Applied Psychological Measurement. ,vol. 12, pp. 253- 260 ,(1988) , 10.1177/014662168801200304
Roderick P. McDonald, Herbert W. Marsh, Choosing a multivariate model: Noncentrality and goodness of fit. Psychological Bulletin. ,vol. 107, pp. 247- 255 ,(1990) , 10.1037/0033-2909.107.2.247
David Thissen, Lynne Steinberg, Meg Gerrard, Beyond group-mean differences: The concept of item bias. Psychological Bulletin. ,vol. 99, pp. 118- 128 ,(1986) , 10.1037/0033-2909.99.1.118
Willianl R. Koch, Likert Scaling Using the Graded Response Latent Trait Model Applied Psychological Measurement. ,vol. 7, pp. 15- 32 ,(1983) , 10.1177/014662168300700104
Robert L. McKinley, Craig N. Mills, A comparison of several goodness-of-fit statistics Applied Psychological Measurement. ,vol. 9, pp. 49- 57 ,(1985) , 10.1177/014662168500900105