Eren Can AYBEK

The Relation of Item Difficulty Between Classical Test Theory and Item Response Theory: Computerized Adaptive Test Perspective

This study aims to transform the calculated item difficulty statistics according to Classical Test Theory (CTT) into the item difficulty parameter of Item Response Theory (IRT) by utilizing the normal distribution curve and to analyze the effectiveness of this transformation based on Rasch model. In this regard, 36 different data sets created with catR package were studied. For each data set, item difficulty parameters and transformed item difficulty parameters were calculated and the correlation coefficients between these parameters were analyzed. Then, Computerized Adaptive Test (CAT) simulations were performed using these parameters. According to the simulation results, the correlation coefficients between the estimated theta values with both methods were high. Furthermore, in CAT simulations in which both parameters were used, especially in the samples which were over 250, it was found to have similar bias, RMSE values, and the average number of administered items.

Keywords:

Item difficulty, classical test theory, item response theory Rasch model,

PDF

___

Bock, R. D. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37(1), 29-51. https://doi.org/10.1007/BF02291411
Crocker, L., & Algina, J. (1986). Introduction to Classical and Modern Test Theory. Cengage Learning.
Çelen, Ü., & Aybek, E. C. (2013). Öğrenci başarısının öğretmen yapımı bir test ile klasik test kuramı ve madde tepki kuramı yöntemleriyle elde edilen puanlara göre karşılaştırılması. Journal of Measurement and Evaluation in Education and Psychology, 4(2), 64-75. https://dergipark.org.tr/tr/pub/epod/issue/5800/77213
De Ayala, R.J. (2009). The theory and practice of item response theory. The Guilford Press.
Hambleton, R., & Swaminathan, R. (1985). Fundementals of Item Response Theory. Sage Pub.
Hambleton, R., Swaminathan, R., & Rogers, H.J. (1991). Fundementals of Item Response Theory. Sage Pub.
Kohli, N., Koran, J., & Henn, L. (2015). Relationships among classical test theory and item response theory frameworks via factor analytic models. Educational and psychological measurement, 75(3), 389–405. https://doi.org/10.1177/0013164414559071
Lord, F.M. (1980). Applications of Item Response Theory to Practical Testing Problems. Routledge.
Magis, D. & Barrada, J.R. (2017). Computerized adaptive testing with R: Recent updates of the package catR. Journal of Statistical Software, Code Snippets, 76(1), 1-19. https://doi.org/10.18637/jss.v076.c01
Magis, D. & Raiche, G. (2012). Random generation of response patterns under computerized adaptive testing with the R package catR. Journal of Statistical Software, 48(8), 1-31. https://doi.org/10.18637/jss.v048.i08
Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47(2), 149-174. https://doi.org/10.1007/BF02296272
Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm. ETS Research Report Series, 1992(1), i-30. https://doi.org/10.1002/j.2333-8504.1992.tb01436.x
Pitman, J. (1993). Probability (6th Edition). Springer.
R Core Team (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. URL https://www.R-project.org/.
Rasch, G. (1961). On general laws and the meaning of measurement in psychology. Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability (pp. 321-333). University of California Press.
Raykov, T., & Marcoulides, G. A. (2016). On the relationship between classical test theory and item response theory: From one to the other and back. Educational and psychological measurement, 76(2), 325–338. https://doi.org/10.1177/0013164415576958
Reckase, D. (2009). Multidimensional Item Response Theory. Springer.
Samejima, F. (1996). Polychotomous responses and the test score. The University of Tennessee.
van der Linden, W. J. & Glas, G.A.W. (2022). Computerized Adaptive Testing: Theory and Practice. Kluwer Academic Publishers.

Eğitimde ve Psikolojide Ölçme ve Değerlendirme Dergisi-Cover

ISSN: 1309-6575
Yayın Aralığı: 4
Başlangıç: 2010
Yayıncı: Selahattin GELBAL

Arşiv

Sayıdaki Diğer Makaleler

Investigation of Measurement Invariance of Turkish Subtest on ABIDE 2016 in Relation to Characteristics of Teachers: Sub-sampling Method

Süleyman ÜLKÜ, Burcu ATAR

The Relation of Item Difficulty Between Classical Test Theory and Item Response Theory: Computerized Adaptive Test Perspective

Eren Can AYBEK

A Comparison of Different Designs in Scoring of PISA 2009 Reading Open Ended Items According to Generalizability Theory

Meral ALKAN, Nuri DOĞAN

Examination of Differential Item Functioning in PISA 2018 Mathematics Literacy Test with Different Methods

Emre KUCAM, Hamide Deniz GÜLLEROĞLU