Çoktan Seçmeli Testlerin Klasik Test Teorisi ve Örtük Özellikler Teorisine Göre Hesaplanan Psikometrik Özelliklerinin İki Kategorili ve Ağırlıklandırılmış Puanlaması Yönünden Karşılaştırılması

Bu araştırmada, çoktan seçmeli test maddelerini iki kategorili (1,0) ve ağırlıklı (1,2,3,4) puanlama yöntemlerinin testin güvenirlik ve geçerliğine etkisi, klasik test teorisi ve örtük özellikler teorisine göre incelenmiştir. Araştırma verileri, 2001-2002 öğretim yılında çeşitli ilköğretim okullarının 4., 5., 6. ve 7. sınıflarında okuyan 1608 öğrenciye uygulanan çoktan seçmeli bir testle elde edilmiştir. Araştırma bulgularına göre, test geliştirme çalışmalarında örtük özellikler teorisinin ve (l ,0) puanlamadan yararlanılmasının daha uygun olacağı düşünülmektedir. Klasik test teorisiyle yapılacak çalışmalarda ise ağırlıklı puanlamadan yararlanılması önerilebilir.

A Comparison of Psychometric Characteristics of Multiple Choice Test Based on the Binary and Weighted Scoring in Respect to Classical Test Theory and Latent Trait Theory

In this research, the effects of binary scoring (1,0) and weighted scoring (1,2,3,4) methods to the reliability and validity of the test have been analysed regarding classical test theory and latent trait theory. The data were collected through the administration of a multiple choice test to 1608 students of 4., 5., 6. and 7. grades of various primary schools in 2001-2002. Regarding the results of the study, it has been concluded that the use of latent trait theory and binary scoring for the test development studies can be more suitable. It is also be recommended that the use of weighted scoring is suitable for the test development studies made through classical test theory.

PDF

___

Bay kul, Y. (1979). Örtük özellikler ve klasik test kuramları üzerine bir karşılaştırma. Yayımlanmamış Doktora Tezi, Hacettepe Üniversitesi, Ankara.
Ben-Simon, A., Budescu, D. V. ve Nevo, B. (1997). A comparative study of measures of partial knowledge in multiple-choice tests. Applied Psychological Measurement. 21(1), 65-88.
Echternacht, G. (1976). Reliability and validity of item option weighting schemes. Educational and Pvchological Measurement. 36, 301-309.
Frary, R. B. (1989). Partial credit scoring methods for multiple choice Tests. Applied Measurement in Education. 2(1), 79-96.
Hambleton, R. K. ve Swaminathan, H. (1992). Item response theory: Principles and application. Boston: Kluwer Academic Publishers Group.
Jaradat, D. ve Tollefson, N. (1988). The Impact of alternative scoring procedures for multiple-choice items on t test Reliability, validity and grading. Educational and Psychological Measurement. 48, 627-635.
Keeves, J. P. (1997). Educational research methodology and measurement: An international handbook (Second Edition). Cambridge: Cambridge University Pres. Linden, V.D. ve Hambleton, R.K. (1997). Handbook of modern item response theory. New York: Springer- Verlag Inc.
Lord, F. (1980). Applications of item response theory to practical testing problems. New Jersey: Lawrence Erlbaum Associates Publishers.
Tatlıdil, H. (1996). Uygulamalı çok değişkenli istatistiksel yöntemler. Ankara.