Eylem DENİZ, Hamparsum BOZDOGAN, Suman KATRAGADDA

Kategorik ve karma veri setlerinin yapısal eşitlik modellemesinde (YEM) Gifi yaklaşımı kullanımı ve bilgi karmaşıklığı kriteri (ICOMP)

Bu çalışmada Yapısal Eşitlik Modelleri’nde (YEM) kategorik, ikili veya karma veri setlerinin analizine ilişkin var olan problemleri çözmek için özgün bir alternatif yaklaşım olarak Gifi yöntemi önerilmiştir. Gifi yönteminde, kategorik değişkenleri nicel hale dönüştürmek için optimal ölçekleme yöntemi kullanılır. Nicelleştirme sürecinde gözlenen değişkendeki bilgi, dönüştürülmüş değişkende aynen korunur. Yani Gifi yöntemi, kategorik değişkenlerin ölçek özelliklerini bozmadan kategorik veriyi sürekli veriye dönüştürür ve bu dönüştürme işleminde herhangi bir bilgi kaybı söz konusu olmaz. Ölçek özellikleri, dönüştürülmüş doğrusal olmayan sürekli Gifi veri uzayında saklanır. Bu nedenle dönüştürme işleminden geriye dönüş mümkündür. Bu işlem, literatürde halen uygulanmakta olan rasgele belirlenmiş başlangıç değerlerini göz ardı eden Gifi sisteme özgün bir özelliktir. Gifi dönüşümünden sonra, çoklu normal dağılım varsayımına dayalı YEM kullanılarak dönüştürülmüş veri seti analiz edilmiştir. Böyle bir yaklaşım YEM’de, kategorik veriler için göz ardı edilen çok değişkenli normal dağılım varsayımını sağlamaktadır. Akaike’nin [1] Akaike Bilgi Kriteri (AIC), Bozdogan’ın [2] Tutarlı Akaike Bilgi Kriteri (CAIC) ve Bozdogan’ın [3-7] Bilgi Karmaşıklığı Kriteri (ICOMP) gibi bilgiye dayalı model seçimkriterleri YEM’de uyumun bir ölçümü olarak uygulanmaktadır. Minimum kriter değerini veren model, rakip modeller arasında veriye en iyi uyumlu model olarak seçilir. Bu çalışmada yaşam kalitesinin ölçüldüğü gerçek bir kategorik veri seti kullanılmıştır. Bu veri setine Gifi dönüşüm uygulayarak önerilen yaklaşımın çok yönlülüğü ve esnekliği gösterilmiştir. Ayrıca dönüştürülmüş veri seti üzerinden farklı YEM için model seçim kriter değerleri elde edilmiş ve minimum kriter değerini veren en iyi model belirlenmiştir.

Anahtar Kelimeler:

Yapısal Eşitlik Modelleri (YEM), Gifi Yöntemi ve Dönüşümü, Akaike Bilgi Kriteri (AIC), Tutarlı Akaike Bilgi Kriteri (CAIC), Bilgi Karmaşıklığı Kriteri (ICOMP)

Structural equation modeling (SEM) of categorical and mixed-data using the novel Gifi transformations and information complexity (ICOMP) criterion

This paper introduces and develops a novel and computationally feasible alternative approach to the analysis of categorical, dichotomous, and mixed data sets in structural equation models (SEMs) to overcome currently existing problems. Our approach is based on the Gifi system. The Gifi system uses the optimal scaling methodology to quantify the observed categorical variables. In the quantification process, information in the observed variable is retained in the quantified variable. That is, the Gifi system transforms categorical data to continuous data without destroying the scale properties of the categorical variables. The scaling is thus preserved in the transformed nonlinear continuous Gifi data space. Hence the transformation is invertible. This is one of the unique characteristics of the Gifi system which avoids the arbitrary thresholding specification that is currently practiced and used in the literature. After the Gifi transformation, we analyze the transformed data set using SEM based on the multinormal distributional assumption. Such an approach legitimizes the distributional assumption of multivariate normality in SEM. Information-theoretic model selection criteria such as Akaike’s [1] AIC, Bozdogan’s [2] Consistent AIC, called CAIC, and the information-theoretic measure of complexity ICOMP criterion of Bozdogan [3-7] are introduced and develop as measures of fit in SEMs. The model with the minimum values of the criteria is selected as the best fitting modelamong a portfolio of candidate models. We provide a real benchmark numerical example using SEM on a categorical data set which measures the quality of life (QOL) to illustrate the versatility and flexibility of our approach using the Gifi transformations on this data set and fit five alternative SEM models by scoring the model selection criteria.

Keywords: