The Effects of Referent Item Parameters on Differential Item Functioning Detection Using the Free Baseline Likelihood Ratio Test

Applied Psychological Measurement - Tập 33 Số 4 - Trang 251-265 - 2009
Gabriel E. Lopez Rivas1, Stephen Stark1, Oleksandr S. Chernyshenko2
1University of South Florida
2#N#Nanyang Business School, Nanyang Technological University#N#

Tóm tắt

The purpose of this simulation study is to investigate the effects of anchor subtest composition on the accuracy of item response theory (IRT) likelihood ratio (LR) differential item functioning (DIF) detection (Thissen, Steinberg, & Wainer, 1988). Here, the IRT LR test was implemented with a free baseline approach wherein a baseline model was formed by freeing all items except a referent or anchor subset and examining the changes in fit with respect to a series of models wherein 1 item at a time was constrained in addition to the referent(s). The results clearly indicated that the composition of the anchor subtest is important for accurate DIF detection. It was found that using a single highly discriminating rather than a low discriminating referent greatly enhanced the power of the procedure. Moreover, in conditions involving small DIF or smaller sample sizes or both, power appeared to improve when a group of highly discriminating referents was used. These findings have implications for applied research involving short scales and small sample sizes.

Từ khóa


Tài liệu tham khảo

American Psychological Association., 1999, Standards for educational and psychological testing

Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability. In F. M. Lord & M. R. Novick (Eds.), Statistical theories of mental test scores (pp. 397-472). Reading, MA : Addison-Wesley.

10.1207/S15324818AME1502_01

Camilli, G., 1994, Methods for identifying biased test items

10.1177/014662168801200304

10.1207/S15327906MBR3604_03

10.1177/014920639902500101

Costa, P.T., Jr., 1989, The NEO-PI/NEO-FFI manual supplement

10.1037/0021-9010.69.3.498

10.1177/0146621605275728

10.1177/014662169602000201

Holland, P.W., 1993, Differential item functioning

Kim, S.-H., 1995, Applied Psychological Measurement, 8, 291

Lord, F.M., 1980, Applications of item response theory to practical testing problems

10.1348/096317902320369703

Samejima, F., 1969, Psychometrika Monograph Supplement, 34, 100

10.1177/106939719502900302

Smith, P.C., 1969, The measurement of satisfaction in work and retirement

10.1111/j.2044-8317.1974.tb00543.x

Stark, S., 1999, SGRGEN: A computer program for polytomous data generation [Computer program]

Stark, S., 2000, 3PLGEN: A computer program for dichotomous data generation [Computer program]

10.1037/0021-9010.89.3.497

10.1037/0021-9010.91.6.1292

Thissen, D., 1991, MULTILOG user's guide (Version 6) [Computer software]

Thissen, D., Steinberg, L. & Wainer, H. (1988). Use of item response theory in the study of group differences in trace lines. In H. Wainer & H. I. Braun (Eds.), Test validity (pp. 147-169). Hillsdale, NJ: Erlbaum.

10.1037/1040-3590.6.3.212

10.1177/0146621603259902

10.3200/JEXE.72.3.221-261