Likelihood-Based Item-Fit Indices for Dichotomous Item Response Theory Models

Applied Psychological Measurement - Tập 24 Số 1 - Trang 50-64 - 2000

Maria Orlando¹, David Thissen²

¹Rand Corp, Santa Monica, CA, US

²University of North Carolina at Chapel Hill

Tóm tắt

New goodness-of-fit indices are introduced for dichotomous item response theory (IRT) models. These indices are based on the likelihoods of number-correct scores derived from the IRT model, and they provide a direct comparison of the modeled and observed frequencies for correct and incorrect responses for each number-correct score. The behavior of Pearson’s X2 ( S- X2) and the likelihood ratio G2 ( S- G2) was assessed in a simulation study and compared with two fit indices similar to those currently in use ( Q1- X2 and Q1- G2). The simulations included three conditions in which the simulating and fitting models were identical and three conditions involving model misspecification. S- X2 performed well, with Type I error rates close to the expected .05 and .01 levels. Performance of this index improved with increased test length. S- G2 tended to reject the null hypothesis too often, as did Q1- X2 and Q1- G2. The power of S- X2 appeared to be similar for all test lengths, but varied depending on the type of model misspecification.

Từ khóa

Tài liệu tham khảo

10.1007/BF02291180

Ankenmann, R. (1994). Goodness of fit and ability estimation in the graded response model. Unpublished manuscript.

10.1007/BF02291411

10.1007/BF02293801

10.1007/BF02291262

Chen, W. (1995). Estimation of item parameters for the three-parameter logistic model using the marginal likelihood of summed scores (Doctoral dissertation, University of North Carolina, 1995). Dissertation Abstracts International, 56/10-B, 5825.

10.1214/aoms/1177729380

10.1007/BF02294405

10.1007/978-94-017-1988-9

10.1177/014662168500900306

10.1080/01621459.1978.10481567

Lord, F. M., 1980, Applications of item response theory to practical testing problems

10.1177/014662168400800409

10.1177/014662168500900105

Mislevy, R. J., 1986, Bilog: Item analysis and test scoring with binary logistic models

Orlando, M. (1997). Item fit in the context of item response theory. (Doctoral dissertation, University of North Carolina, 1997). Dissertation Abstracts International, 58/04-B, 2175.

Rasch, G., 1960, Probabilistic models for some intelligence and attainment tests

10.1007/978-1-4612-4578-0

10.1177/014662168701100103

10.1080/01621459.1971.10482341

10.1177/014662169401800206

10.1007/978-1-4612-6390-6

Thissen, D., 1991, MULTILOG user’s guide: Multiple categorical item analysis and test scoring using item response theory

10.1177/014662169501900105

Wainer, H., 1990, Computerized adaptive testing: A primer, 65

Wright, B., 1977, BICAL: Calibrating items and scales with the Rasch model

10.1177/001316446902900102

10.1177/014662168100500212

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Về chúng tôi

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích các bài báo, công bố khoa học Việt Nam. Công cụ trợ giúp người nghiên cứu, tạp chí, đơn vị nghiên cứu tra cứu, phân tích và thống kê dữ liệu nghiên cứu khoa học tại Việt Nam và quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia vào Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Công cụ kiểm tra chính tả và thể thức Viver

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA