Likelihood-Based Item-Fit Indices for Dichotomous Item Response Theory Models

Applied Psychological Measurement - Tập 24 Số 1 - Trang 50-64 - 2000
Maria Orlando1, David Thissen2
1Rand Corp, Santa Monica, CA, US
2University of North Carolina at Chapel Hill

Tóm tắt

New goodness-of-fit indices are introduced for dichotomous item response theory (IRT) models. These indices are based on the likelihoods of number-correct scores derived from the IRT model, and they provide a direct comparison of the modeled and observed frequencies for correct and incorrect responses for each number-correct score. The behavior of Pearson’s X2 ( S- X2) and the likelihood ratio G2 ( S- G2) was assessed in a simulation study and compared with two fit indices similar to those currently in use ( Q1- X2 and Q1- G2). The simulations included three conditions in which the simulating and fitting models were identical and three conditions involving model misspecification. S- X2 performed well, with Type I error rates close to the expected .05 and .01 levels. Performance of this index improved with increased test length. S- G2 tended to reject the null hypothesis too often, as did Q1- X2 and Q1- G2. The power of S- X2 appeared to be similar for all test lengths, but varied depending on the type of model misspecification.

Từ khóa


Tài liệu tham khảo

10.1007/BF02291180

Ankenmann, R. (1994). Goodness of fit and ability estimation in the graded response model. Unpublished manuscript.

10.1007/BF02291411

10.1007/BF02293801

10.1007/BF02291262

Chen, W. (1995). Estimation of item parameters for the three-parameter logistic model using the marginal likelihood of summed scores (Doctoral dissertation, University of North Carolina, 1995). Dissertation Abstracts International, 56/10-B, 5825.

10.1214/aoms/1177729380

10.1007/BF02294405

10.1007/978-94-017-1988-9

10.1177/014662168500900306

10.1080/01621459.1978.10481567

Lord, F. M., 1980, Applications of item response theory to practical testing problems

10.1177/014662168400800409

10.1177/014662168500900105

Mislevy, R. J., 1986, Bilog: Item analysis and test scoring with binary logistic models

Orlando, M. (1997). Item fit in the context of item response theory. (Doctoral dissertation, University of North Carolina, 1997). Dissertation Abstracts International, 58/04-B, 2175.

Rasch, G., 1960, Probabilistic models for some intelligence and attainment tests

10.1007/978-1-4612-4578-0

10.1177/014662168701100103

10.1080/01621459.1971.10482341

10.1177/014662169401800206

10.1007/978-1-4612-6390-6

Thissen, D., 1991, MULTILOG user’s guide: Multiple categorical item analysis and test scoring using item response theory

10.1177/014662169501900105

Wainer, H., 1990, Computerized adaptive testing: A primer, 65

Wright, B., 1977, BICAL: Calibrating items and scales with the Rasch model

10.1177/001316446902900102

10.1177/014662168100500212