Kelly D. BRADLEY, Michael PEABODY, Shannon O. SAMPSON

Quality Control in Survey Design: Evaluating a Survey of Educators’ Attitudes Concerning Differentiated Compensation

This study utilized the Rasch model to assess the quality of a survey instrument designed to measure attitudes of administrators and teachers concerning a differentiated teacher compensation program piloted in Kentucky. Researchers addressing potentially contentious issues should ensure their methods stand up to rigorous criticism. The results indicate that the rating scale does not function as expected, with items being too easy to endorse. Future iterations of this survey should be revised prior to release. Recommendations for improvement are provided.

Anahtar Kelimeler:

Rasch measurement, Survey design, Rating Scale, Teacher Compensation

Quality Control in Survey Design: Evaluating a Survey of Educators’ Attitudes Concerning Differentiated Compensation

Keywords:

Rasch measurement, Survey design, Rating Scale, Teacher Compensation,

PDF

___

Andrich D., (1978). A Rating Formulation for Ordered Response Categories. Psychometrika, 43(4), 561-573.
Becker G., (2001). Controlling Decremental and Inflationary Effects in Reliability Estimation Resulting from Violations of Assumptions. Psychological Reports, 89(2), 403-424. doi: 10.2466/pr0.2001.89.2.403
Bond T.G. & Fox C.M., (2001). Applying the Rasch Model: Fundamental Measurement in the Human Sciences. Mahwah, NJ: Lawrence Erlbaum.
Bradley K.D. & Sampson S.O., (2005). A case for using Rasch Rating Scale analysis to assess the quality of measurement in survey research. The Respondent, 12-13.
Fisher W., (1992). Reliability, Separation, Strata Statistics. Rasch Measurement Transactions, 6(3), 238.
Guttman L., (1944). A Basis for Scaling Qualitative Data. American Sociological Review, 9(2), 139-150.
Hambleton R.K., Swaminathan, H., & Rogers, H.J., (1991). Fundamentals of Item Response Theory. New York: Sage Publications.
Linacre J.M., (2002a). Optimizing Rating Scale Category Effectiveness. Journal of Applied Measurement, 3(1), 85-106.
Linacre J.M., (2002b). What do Infit and Outfit, Mean-square and Standardized mean? Rasch Measurement Transactions, 16(2), 878.
Linacre J.M., (2004). Winsteps Rasch Measurement computer program (Version 3.51). Beaverton, OR: Winsteps.com.
Rasch G., (1960). Probabilistic models for some intelligence and attainment tests. Copenhagen, Denmark.: Danish Institute for Educational Research.
Wright B.D. (1996). Reliability and Separation. Rasch Measurement Transactions, 9(4), 472.
Wright B.D., (1997). Fundamental Measurement for Outcome Evaluation. Physical Medicine and Rehabilitation: State of the Art Reviews, 11(2), 261-288.
Wright B.D. & Masters G.N., (1982). Rating scale analysis. Chicago: MESA Press.
Wright B.D. & Stone M.H., (1979). Best Test Design. Chicago: MESA Press.

International Journal of Assessment Tools in Education-Cover

Yayın Aralığı: 4
Başlangıç: 2014
Yayıncı: İzzet KARA

Arşiv

Sayıdaki Diğer Makaleler

Fen Öğrenme Tutumları Ölçeği (FÖTÖ): Geçerlik ve Güvenirlik Çalışması

Adem BAYAR, Orhan KARAMUSTAFAOĞLU

Örgütsel Politika Algısı Ölçeğinin (POPS) Türkçe Uyarlaması: Geçerlik ve Güvenirlik Çalışması2

Evrim EROL

A Comparison of Logistic Regression Models for DIF Detection in Polytomous Items: The Effect of Small Sample Sizes and Non-Normality of Ability Distributions

Yasemin KAYA, Walter L. LEİTE, M. David MİLLER

Quality Control in Survey Design: Evaluating a Survey of Educators’ Attitudes Concerning Differentiated Compensation

Kelly D. BRADLEY, Michael PEABODY, Shannon O. SAMPSON