Application of kappa statistics in sequential tests for family-based design

Application of kappa statistics in sequential tests for family-based design

Family-based designs are commonly used in genetic association studies to locate markers associated with diseases. It is a challenging task to collect a large enough sample size, perform a statistical test, and obtain the desired statistical power. The sequential probability ratio test (SPRT) was introduced to overcome the limited sample size problem. However, the drawback of SPRT is that, for the sake of accuracy, the test leaves many markers in a gray zone meaning no decision . In this article, we propose a novel approach: a sequential probability ratio test plus (SPRT+) to reduce the number of these gray zone markers. Using simulated data, the results of SPRT+ are compared with the results of SPRT. SPRT+ shows a promising overall performance in identifying highly and moderately associated markers in the correct association region without a loss of accuracy

___

  • [1] Johnson AD, O’Donnell CJ. An open access database of genome-wide association results. BMC Med Genet 2009; 10: 6.
  • [2] Ku CS, Loy EY, Pawitan Y, Chia KS. The pursuit of genome-wide association studies: where are we now? J Hum Genet 2010; 55: 195-206
  • [3] Bush WS, Moore JH. Genome-wide association studies. PLoS Comput Biol 2012; 8: 12-18.
  • [4] Laird NM, Lange C. The role of family-based designs in genome-wide association studies. Stat Sci 2009; 24: 388-397.
  • [5] Falk CT, Rubinstein P. Haplotype relative risks: an easy reliable way to construct a proper control sample for risk calculations. Ann Hum Genet 1987; 51: 227-233.
  • [6] Ott J. Statistical properties of the haplotype relative risk. Genet Epidemiol 1989; 6: 127-130.
  • [7] Spielman RS, McGinnis RE, Ewens WJ. Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). Am J Hum Genet 1993; 52: 506-516.
  • [8] Lasky-Su J, Won S, Mick E, Anney RJL, Franke B, Neale B, Biederman J, Smalley SL, Loo SK, Todorov A et al. On genome-wide association studies for family-based designs: an ıntegrative analysis approach combining ascertained family samples with unselected controls. Am J Hum Genet 2010; 86: 573-580.
  • [9] Ott J, Kamatani Y, Lathrop M. Family-based designs for genome-wide association studies. Nat Rev Genet 2011; 12: 465-474.
  • [10] Laird NM, Lange C. Family-based methods for linkage and association analysis. Adv Genet 2008; 60: 219-252.
  • [11] Spielman RS, Ewens WJ. The TDT and other family-based tests for linkage disequilibrium and association. Am J Hum Genet 1996; 59: 983-989.
  • [12] Wang D, Sun F. Sample sizes for the transmission disequilibrium tests: TDT, S-TDT and 1-TDT. Commun Stat Theory 2000; 29: 1129-1142.
  • [13] Ilk O, Rajabli F, Dungul-Ciglidag D, Ozdag H, Ilk HG. A novel approach for small sample size family-based association studies: sequential tests. Eur J Hum Genet 2011; 19: 915-920.
  • [14] Anderson TW. A modification of the sequential probability ratio test to reduce the sample size. Ann Math Stat 1959; 34: 165-197.
  • [15] Wald A. Sequential Analysis. New York, NY, USA: John Wiley & Sons, 1947.
  • [16] Spigelhalter D, Grigg O, Kinsman R, Treasure T. Risk-adjusted sequential probability ratio test: application to Bristol, Shipman and adult cardiac surgery. Int J Qual Health Care 2003; 15: 7-13.
  • [17] Viera AJ, Garrett JM. Understanding ınterobserver agreement: the kappa statistic. Fam Med 2005; 37: 360-363.
  • [18] Huang DW, Sherman BT, Tan Q, Collins JR, Alvord WG, Roayaei J, Stephens R, Baseler MW, Lane HC, Lempicki RA. The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists. Genome Biol 2007; 8: R183