Generalized Estimating Equations for Genetic Association Studies of Multi-Correlated Longitudinal Family Data
Generalized Estimating Equations for Genetic Association Studies of Multi-Correlated Longitudinal Family Data
In genetic epidemiology studies, many diseases are multifactorial that can be bothenvironmental and genetic inherited pattern. The relationship between genetic variability andindividual phenotypes is usually investigated by genetic association studies. In geneticassociation studies, longitudinal measures are very important scale in detecting disease variants.They enable to observe both factors in the progress of disease. Generalized Linear Modelling(GLM) techniques offer a flexible approach for testing and quantifying genetic associationsconsidering different types of phenotype distributions. In this study, it is aimed to accommodateGeneralized Estimating Equations (GEE) method for genetic association studies in the presenceof both familial and serial correlation. For this purpose, a real genotyped data set with thepedigree information and a continuous trait measured over time is used to model the associationbetween the disease and the genotype by analyzing several variants, which have been associatedwith the disease. A joint working correlation structure is adapted, accounting for two differentsources of correlations for estimating equations.
___
- Shults, J., Whitt, M.C., Kumanyika, S., “Analysis of data with multiple sources of correlation in the
framework of generalize estimating equations”, Statistics in Medicine, 23(20): 3209- 3226, (2004).
- Liang, K.Y., Zeger, S.L., “Longitudinal data analysis using generalized linear models”, Biometrika,
73: 13-22, (1986).
- Hubbard, A.E., Ahern, J., Fleischer, N.L., Van der Laan, M., Lippman, S.A., Jewell, N., Bruckner,
T., Satariano, W.A., “To GEE or Not to GEE: Comparing population average and mixed models for
estimating the associations between neighbourhood risk factors and health”, Epidemiology, 21(4):
467- 474, (2010).
- Brown, H., Prescott, R., Applied Mixed Models in Medicine. Statistics in Practice, Wiley Series,
England, (2006).
- Qaqish, B. F. and Liang K. Y., “Marginal models for correlated binary responses with multiple
classes and multiple levels of nesting”, Biometrics, 48, 939-950,(1992).
- Ten Have, T.R., Kunselman, A.R., Tran, L., “A comparison of mixed effects logistic regression
models for binary response data with two nested levels of clustering”, Statistics in Medicine, 18(8):
947-960, (1999).
- Joy, J. and Lin X., “Latent Variable Models for Longitudinal Data with Multiple Continuous
Outcomes”, Biometrics, 56(4):1047-1054, (2000).
- Lefkopoulou, M. Moore, D., Ryan, L. “ The Analysis of Multiple Correlated Binary Outcomes:
Application to Rodent Teratolgy Experiments”, Journal of the American Statistical Association,
84(407): 810-815, (1989).
- Levy, D., Ehret, G.B., Rice, K., Verwoert, G.C., Launer, L.J., Dehghan, A., Glazer, N.L., Morrison,
A.C., Johnson, A.D., Aspelund, T., Aulchenko, Y., Lumley, T., Köttgen, A., Vasan, R.S.,
Rivadeneira, F., Eiriksdottir, G., Guo, X., Arking, D.E., Mitchelli,.G.F., Mattace-Raso, F.U., Smith,
A.V., Taylor, K., Scharpf, R.B., Hwang, S.J., Sijbrands, E.J., Bis, J., Harris, T.B., Ganesh, S.K.,
O'Donnell, C.J., Hofman, A., Rotter, J.I., Coresh, J., Benjamin, E.J., Uitterlinden, A.G., Heiss, G.,
Fox, C.S., Witteman, J.C., Boerwinkle, E., Wang, T.J., Gudnason, V., Larson, M.G., Chakravarti, A.,
Psaty, B.M., van Duijn, C.M., “Genome-wide association study of blood pressure and hypertension”,
Nat. Genet., 41(6): 77–687, (2009).
- Almasy, L., Dyer, T.D., Peralta, J.M., Jun, G., Wood, A.R., Fuchsberger, C., Almeida, M.A., Kent,
Jr S.W., Fowler, S., Blackwell, T.W., Puppala, S., Kumar, S., Curran, J.E., Lehman, D., Abecasis,
G., Duggirala, R., Blangero, J., The T2D-GENES Consortium, “Data for genetic analysis workshop
18: human whole genome sequence, blood pressure and simulated phenotypes in extended
pedigrees”, BMC Proc., 8 (suppl. 2):S2, (2014).
- Barsh, G.S., Copenhaver, G.P., Gibson, G., Williams, S.M., “Guidelines for genome-wide
association studies”, PloS Genetics, 8, 7, (2012).