A new approach to test score equating using item response theory with fixed C-parameters

Asia Pacific Education Review - Tập 9 - Trang 248-261 - 2008
Guemin Lee1, Anne R. Fitzpatrick1,2
1Department of Education, Yonsei University, Seodaemoon-Ku, Korea
2Educational Testing Services, USA

Tóm tắt

Because parameter estimates from different calibration runs under the IRT model are linearly related, a linear equation can convert IRT parameter estimates onto another scale metric without changing the probability of a correct response (Kolen & Brennan, 1995, 2004). This study was designed to explore a new approach to finding a linear equation by fixing C-parameters for anchor items in IRT equating. A rationale for fixing C-parameters for anchor items in IRT equating can be established from the fact that the C-parameters are not affected by any linear transformation. This new approach can avoid the difficulty in getting accurate C-parameters for anchor items embedded in the application of the IRT model. Based upon our findings in this study, we would recommend using the new approach to fix C-parameters for anchor items in IRT equating.

Tài liệu tham khảo

Baker, F. B., & Al-Karni, A. (1991). A comparison of two procedures for computing IRT equating coefficients.Journal of Educational Measurement, 28, 147–162. Bock, R. D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm.Psychometrika, 46, 443–459. Bucket, G. R. (1996).PARDUXMX [Computer program]. Unpublished. Bucket, G. R. (2000).PARDUXMJ [Computer program]. Unpublished. Cook, L. L., & Eignor, D. R. (1991). An NCME instructional module on IRT equating methods.Educational Measurement: Issues and Practices, 10, 37–45. Hambleton, R. K. (1989). Principles and selected applications of item response theory. In R.L. Linn (Ed.),Educational measurement (3rd ed.). Phoenix, AZ: Oryx Press. Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991).Fundamentals of item response theory. Newbury Park, CA: Sage. Hanson, B. A., & Beguin, A. A. (2002). Obtaining a common scale for item response theory item parameters using separate versus concurrent estimation in the common-item equating design.Applied Psychological Measurement, 26, 3–24. Hung, P., Wu, Y., & Chen, Y. (1991).IRT item parameter linking: Relevant issues for the purpose of item banking. Paper presented at the International Academic Symposium on Psychological Measurement, Tainan, Taiwan. Kim, S.-H., & Cohen, A. S. (1992). Effects of linking methods on detection of DIF.Journal of Educational Measurement, 29, 51–66. Kolen, M. J., & Brennan, R. L. (1995).Test equating: Methods and practices. New York: Springer-Verlag. Kolen, M. J., & Brennan, R. L. (2004).Test equating, scaling, and linking: Methods and practices (2nd ed.). New York: Springer. Lord, F. M. (1980).Applications of item response theory to practical testing problems._Hillsdale, NJ: Erlbaum. Loyd, B. H., & Hoover, H. D. (1980). Vertical equating using the Rasch model.Journal of Educational Measurement, 17, 179–193. Marco, G. L. (1977). Item characteristic curve solutions to three intractable testing problems.Journal of Educational Measurement, 14, 139–160. Ogasawara, H. (2001). Least squares estimation of item response theory linking coefficients.Applied Psychological Measurement, 25, 3–24. Stocking, M. L., & Lord, F. M. (1983). Developing a common metric in item response theory.Applied Psychological Measurement, 7, 201–210. Thissen, D. M., & Wainer, H. (1982). Some standard errors in item response theory.Psychometrika, 47, 397–412. Way, W. D., & Tang, K. L. (1991).A comparison of four logistic model equating methods. Paper presented at the annual meeting of the American Educational Research Association, Chicago, IL. Wingersky, M. S., Barton, M. A., & Lord, F. M. (1982).LOGIST users guide. Princeton, NJ: Educational Testing Service. Yen, W. (1990).CTB scaling specifications. Unpublished research paper.