Aynı Tepki Kategorilerine Sahip Likert Maddelerin Psikometrik Özelliklerinin Çok Kategorili Madde Tepki Kuramı Modelleri ile İncelenmesi

Investigation of Psychometric Properties of Likert Items with the Same Response Categories Using Polytomous Item Response Theory Models

The purpose of this study was to investigate within- and between-threshold parameter invariance for items of afourteen-item Positive Affect Scale developed to assess positive moods (like happy, peaceful, etc.) of universitystudents. To test whether the estimated threshold parameters were as expected (1 to 5, with increments of 1)across all the 14 items, Graded Response, Partial Credit, and Rating Scale Models were fit the response datacollected from 326 students. A comparison of the model fit statistics, such as the negative 2log likelihood andchi-square values, revealed that the Graded Response Model had the best fit and that the thresholds estimates forall the items in the Positive Affective Scale were reasonably close to the expected 1 to 5 values with incrementsof 1. The study illustrates how polytomous response models can be used to test the psychometric quality of itemswith ordinal rating scales.

___

  • Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43(4), 561-573. doi: 10.1007/BF02293814
  • Baker, F. B. (2001). The basis of item response theory. USA: ERIC Clearing house on Assessment and Evaluation.
  • Baker, J. G., Rounds, J. B., & Zevon, M. A. (2000). A comparison of graded response and Rasch partial credit models with subjective well-being. Journal of Educational and Behavioral Statistics, 25(3), 253-270. doi: 10.3102/10769986025003253
  • Brown, T. A. (2015). Confirmatory factor analysis for applied research. New York, NY: Guilford.
  • Caycho-Rodríguez, T., Vilca, L. W., Carbajal-León, C., White, M., Vivanco-Vidal, A., Saroli-Araníbar, D., ..., Moreta-Herrera, R. (2021). Coronavirus anxiety scale: New psychometric evidence for the Spanish version based on CFA and IRT models in a Peruvian sample. Death Studies. doi: 10.1080/07481187.2020.1865480
  • Chalmers, R. P. (2012). mirt: A multidimensional item response theory package for the R environment. Journal of Statistical Software, 48(6), 1-29. doi: 10.18637/jss.v048.i06
  • Chernyshenko, O. S., Stark, S., Chan, K. Y., Drasgow, F., & Williams, B. (2001). Fitting item response theory models to two personality inventories: issues and insights. Multivariate Behavioral Research, 36(4), 523-562. doi: 10.1207/S15327906MBR3604_03
  • Cho, S., Drasgow, F., & Cao, M. (2015). An investigation of emotional intelligence measures using item response theory. Psychological Assessment, 27(4), 1241-1252. doi: 10.1037/pas0000132
  • de Ayala, R. J., Dodd, B. G., & Koch, W. R. (1990, April). A comparison of the partial credit and graded response model in computerized adaptive testing. Paper presented at the AERA Annual Meeting. Boston.
  • DeMars, C. (2010). Item response theory: Understanding statistics measurement. Oxford: Oxford University Press.
  • Demirtaşlı, N., Yalçın, S., & Ayan, C. (2016). The development of irt based attitude scale towards educational measurement course. Eğitimde ve Psikolojide Ölçme ve Değerlendirme Dergisi, 1(7), 133-144. doi: 10.21031/epod.43804
  • Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. New Jersey: LEA publishers.
  • Ferrando, P., Lorenzo, U., & Molina, G. (2001). An item response theory analysis of response stability in personality measurement. Applied Psychological Measurement, 25(1), 3-17. doi: 10.1177/01466216010251001
  • Flannery, W. P., Reise, S. P., & Widaman, K. F. (1995). An item response theory analysis of the general and academic scales of the self-description questionnaire II. Journal of Research in Personality, 29(2), 168- 188. doi: 10.1006/jrpe.1995.1010
  • Glass, G. V., & Hopkins, K. D. (1984). Statistical methods in education and psychology. Englewood Cliffs, NJ: Prentice Hall.
  • Gray-Little, B., Williams, V., & Hancock, T. (1997). An item response theory analysis of the Rosenberg self - esteem scale. Personality and Social Psychology Bulletin, 23(5), 443-451. doi: 10.1177/0146167297235001
  • Hambleton, R. K., & Swaminathan, H. (1985). Item response theory: Principles and applications. New York: Springer Science and Business Media.
  • Hattie, J. (1992). Self-concept. Hillsdale, NJ: Erlbaum.
  • Hu, L. T., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling, 6(1), 1-55. doi: 10.1080/10705519909540118
  • Kahraman, N., Akbaş, D., & Sözer, E. (2019). Bilişsel-olmayan öğrenme durum ve süreçlerini ölçme ve değerlendirmede boylamsal yaklaşımlar: Duygu Cetveli Alan Uygulaması örneği. Bolu Abant İzzet Baysal Üniversitesi Eğitim Fakültesi Dergisi, 19(1), 257-269. doi: 10.17240/aibuefd.2019.19.43815- 459831
  • Kaptan, S. (1995). Bilimsel araştırma ve istatistik teknikleri (10. Basım). Ankara: Rehber Yayınevi.
  • Koch, W. R. (1983). Likert scaling using the graded response latent trait model. Applied Psychological Measurement, 7(1), 15-32. doi: 10.1177/014662168300700104
  • Köse, İ. A. (2015). Aşamalı tepki modeli ve klasik test kuramı altında elde edilen test ve madde parametrelerinin karşılaştırılması. Abant İzzet Baysal Üniversitesi Eğitim Fakültesi Dergisi, 15(2), 184-197. Retrieved from https://dergipark.org.tr/tr/download/article-file/17439
  • Linacre, J. M. (2002). Optimizing rating scale category effectiveness. Journal of Applied Measurement, 3(1), 85-106. Retrieved from http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.424.2811&rep=rep1&type=pdf
  • Lord, F. M. (1980). Applications of item response theory practical testing problems. Hillsdale, NJ: Erlbaum.
  • Messick, S. (1995). Validity of psychological assessment: Validation of inferences from persons’ responses and performances as scientific inquiry into score meaning. American Psychologist, 50, 741-749. doi: 10.1002/j.2333-8504.1994.tb01618.x
  • Min, S., & Aryadoust, V. (2021). A systematic review of item response theory in language assessment: implications for the dimensionality of language ability. Studies in Educational Evaluation, 68. doi: 10.1016/j.stueduc.2020.100963
  • Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm. Applied Psychological Measurement, 16(2), 159-176. doi: 10.1177/014662169201600206
  • R Core Team. (2016). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from https://www.R-project.org/
  • Roskam, E. E. (1985). Current issues in item response theory. In E. E. Roskam (Ed.), Measurement and personality assessment (pp. 3-19). Amsterdam: North Holland.
  • Rubio, V. J., Aguado, D., Hontangas, P. M., & Hernandez, J. M. (2007). Psychometric properties of an emotional adjustment measure: an application of the Graded Response Model. European Journal of Psychological Assessment, 23(1), 39-46. doi: 10.1027/1015-5759.23.1.39
  • Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores (Psychometric Monography No. 17). Retrieved from https://www.psychometricsociety.org/sites/main/files/fileattachments/mn17.pdf?1576606975
  • Samejima, F. (1996). Evaluation of mathematical models for ordered polychotomous responses. Behaviormetrika, 23, 17-35. doi: 10.2333/bhmk.23.17
  • Silvia, P. J. (2021). The self-reflection and insight scale: applying item response theory to craft an efficient short form. Current Psychology. doi: 10.1007/s12144-020-01299-7
  • Steiger, J. H., & Lind, J. M. (1980, May). Statistically based tests for the number of common factors. Paper presented in Psychometric Society. Iowa City.
  • Wang, W. C., Wilson, M., & Shih, C. L. (2006). Modeling randomness in judging rating scales with a randomeffects rating scale model. Journal of Educational Measurement, 43(4), 335-353. doi: 10.1111/j.1745- 3984.2006.00020.x
  • Watson, D., Clark, L. A., & Tellegen, A. (1988). Development and validation of brief measures of positive and negative affect: The PANAS scales. Journal of Personality and Social Psychology, 54(6), 1063-1070. doi: 10.1037/0022-3514.54.6.1063
  • Wu, M., & Adams, R. (2006). Modelling mathematics problem solving item responses using a multidimensional IRT model. Mathematics Education Research Journal, 18(2), 93-113. doi: 10.1007/BF03217438
  • Yaşar, M., & Aybek, E. C. (2019). Üniversite öğrencileri için bir yılmazlık ölçeğinin geliştirilmesi: Madde tepki kuramı temelinde geçerlilik ve güvenilirlik çalışması. İlköğretim Online, 18(4), 1687-1699. Retrieved from https://ilkogretim-online.org/fulltext/218-1597121020.pdf?1618815938
  • Yen, W. M. (1984). Effects of local item dependence on the fit and equating performance of the three-parameter logistic model. Applied Psychological Measurement, 8(2), 125-145. doi: 10.1177/014662168400800201