Evaluation of Procedure-Based Scoring for Hands-On Science Assessment

作者: Gail P. Baxter , Richard J. Shavelson , Susan R. Goldman , Jerry Pine

DOI: 10.1111/J.1745-3984.1992.TB00364.X

关键词:

摘要: This article evaluates a procedure-based scoring system for performance assessment (an observed paper towels investigation) and notebook surrogate completed by fifth-grade students varying in hands-on science experience. Results suggested interrater reliability of scores notebooks was adequate (>.80) with the former higher. In contrast, agreement on procedures higher (.92) than (.66). Moreover, notebooks, varied student experience, but this not so performance. Both observed-performance measures correlated less traditional ability did multiple-choice achievement test. The correlation between two assessments test only moderate (mean = .46), suggesting that different aspects have been measured. Finally, .83, may provide reasonable, albeit reliable, students.

参考文章(9)
Richard J. Shavelson, Indicators of Science Achievement: Options for a Powerful Policy Instrument. Phi Delta Kappan. ,vol. 71, ,(1990)
Guanping Cai, 義澄 周, Huzhang Liu, Education and science Foreign Languages Press : Distributed by China Publications Centre. ,(1983)
Donald Thomas Campbell, Thomas D. Cook, Quasi-Experimentation: Design & Analysis Issues for Field Settings ,(1979)
Noreen M. Webb, Richard J. Shavelson, Kyung-Sung Kim, Zheng Chen, Reliability (Generalizability) of Job Performance Measurements: Navy Machinist Mates Military Psychology. ,vol. 1, pp. 91- 110 ,(1989) , 10.1207/S15327876MP0102_3
Raymond S. Nickerson, New Directions in Educational Assessment Educational Researcher. ,vol. 18, pp. 3- 7 ,(1989) , 10.3102/0013189X018009003
Herbert I. Weisberg, Statistical Adjustments and Uncontrolled Studies Psychological Bulletin. ,vol. 86, pp. 1149- 1164 ,(1979) , 10.1037//0033-2909.86.5.1149
Norman Frederiksen, THE REAL TEST BIAS ETS Research Report Series. ,vol. 1981, ,(1981) , 10.1002/J.2333-8504.1981.TB01267.X
Richard J. Shavelson, Paul W. Mayberry, Weichang Li, Noreen M. Webb, Generalizability of Job Performance Measurements: Marine Corps Rifleman Military Psychology. ,vol. 2, pp. 129- 144 ,(1990) , 10.1207/S15327876MP0203_1
Robert L. Linn, Educational measurement, 3rd ed. American Council on Education. ,(1989)