A non-parametric approach for detecting gene-gene interactions associated with age-at-onset outcomes

BMC Genetics - Tập 15 - Trang 1-11 - 2014
Ming Li1, Joseph C Gardiner2, Naomi Breslau2, James C Anthony2, Qing Lu2
1Division of Biostatistics, Department of Pediatrics, University of Arkansas for Medical Sciences, Little Rock, USA
2Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, USA

Tóm tắt

Cox-regression-based methods have been commonly used for the analyses of survival outcomes, such as age-at-disease-onset. These methods generally assume the hazard functions are proportional among various risk groups. However, such an assumption may not be valid in genetic association studies, especially when complex interactions are involved. In addition, genetic association studies commonly adopt case-control designs. Direct use of Cox regression to case-control data may yield biased estimators and incorrect statistical inference. We propose a non-parametric approach, the weighted Nelson-Aalen (WNA) approach, for detecting genetic variants that are associated with age-dependent outcomes. The proposed approach can be directly applied to prospective cohort studies, and can be easily extended for population-based case-control studies. Moreover, it does not rely on any assumptions of the disease inheritance models, and is able to capture high-order gene-gene interactions. Through simulations, we show the proposed approach outperforms Cox-regression-based methods in various scenarios. We also conduct an empirical study of progression of nicotine dependence by applying the WNA approach to three independent datasets from the Study of Addiction: Genetics and Environment. In the initial dataset, two SNPs, rs6570989 and rs2930357, located in genes GRIK2 and CSMD1, are found to be significantly associated with the progression of nicotine dependence (ND). The joint association is further replicated in two independent datasets. Further analysis suggests that these two genes may interact and be associated with the progression of ND. As demonstrated by the simulation studies and real data analysis, the proposed approach provides an efficient tool for detecting genetic interactions associated with age-at-onset outcomes.

Tài liệu tham khảo

Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, Cho JH, Guttmacher AE, Kong A, Kruglyak L, Mardis E, Rotimi CN, Slatkin M, Valle D, Whittemore AS, Boehnke M, Clark AG, Eichler EE, Gibson G, Haines JL, Mackay TF, McCarroll SA, Visscher PM: Finding the missing heritability of complex diseases. Nature. 2009, 461 (7265): 747-753. 10.1038/nature08494. So HC, Gui AH, Cherny SS, Sham PC: Evaluating the heritability explained by known susceptibility variants: a survey of ten complex diseases. Genet Epidemiol. 2011, 35 (5): 310-317. 10.1002/gepi.20579. Moore JH: The ubiquitous nature of epistasis in determining susceptibility to common human diseases. Hum Hered. 2003, 56 (1–3): 73-82. Nagel RL: Epistasis and the genetics of human diseases. C R Biol. 2005, 328 (7): 606-615. 10.1016/j.crvi.2005.05.003. Pendergrass SA, Brown-Gentry K, Dudek SM, Torstenson ES, Ambite JL, Avery CL, Buyske S, Cai C, Fesinmeyer MD, Haiman C, Heiss G, Hindorff LA, Hsu CN, Jackson RD, Kooperberg C, Le Marchand L, Lin Y, Matise TC, Moreland L, Monroe K, Reiner AP, Wallace R, Wilkens LR, Crawford DC, Ritchie MD: The use of phenome-wide association studies (PheWAS) for exploration of novel genotype-phenotype relationships and pleiotropy discovery. Genet Epidemiol. 2011, 35 (5): 410-422. 10.1002/gepi.20589. Fisher RA: The genetical theory of nature selection. 1930, Oxford: The Clarendon Press Partridge L, Gems D: Mechanisms of ageing: public or private?. Nat Rev Genet. 2002, 3 (3): 165-175. 10.1038/nrg753. Wright A, Charlesworth B, Rudan I, Carothers A, Campbell H: A polygenic basis for late-onset disease. Trends Genet. 2003, 19 (2): 97-106. 10.1016/S0168-9525(02)00033-1. Lin PI, McInnis MG, Potash JB, Willour VL, Mackinnon DF, Miao K, Depaulo JR, Zandi PP: Assessment of the effect of age at onset on linkage to bipolar disorder: evidence on chromosomes 18p and 21q. Am J Hum Genet. 2005, 77 (4): 545-555. 10.1086/491602. Price DL, Sisodia SS, Borchelt DR: Alzheimer disease–when and why?. Nat Genet. 1998, 19 (4): 314-316. 10.1038/1196. Azzato EM, Pharoah PD, Harrington P, Easton DF, Greenberg D, Caporaso NE, Chanock SJ, Hoover RN, Thomas G, Hunter DJ, Kraft P: A genome-wide association study of prognosis in breast cancer. Cancer Epidemiol Biomarkers Prev. 2010, 19 (4): 1140-1143. 10.1158/1055-9965.EPI-10-0085. Pillas D, Hoggart CJ, Evans DM, O'Reilly PF, Sipila K, Lahdesmaki R, Millwood IY, Kaakinen M, Netuveli G, Blane D, Charoen P, Sovio U, Pouta A, Freimer N, Hartikainen AL, Laitinen J, Vaara S, Glaser B, Crawford P, Timpson NJ, Ring SM, Deng G, Zhang W, McCarthy MI, Deloukas P, Peltonen L, Elliott P, Coin LJ, Smith GD, Jarvelin MR: Genome-wide association study reveals multiple loci associated with primary tooth development during infancy. PLoS Genet. 2010, 6 (2): e1000856-10.1371/journal.pgen.1000856. van Manen D, Delaneau O, Kootstra NA, Boeser-Nunnink BD, Limou S, Bol SM, Burger JA, Zwinderman AH, Moerland PD, van't Slot R, Zagury JF, Wout AB V 't, Schuitemaker H: Genome-wide association scan in HIV-1-infected individuals identifying variants influencing disease course. PLoS One. 2011, 6 (7): e22208-10.1371/journal.pone.0022208. Scheike TH, Martinussen T, Silver JD: Estimating haplotype effects for survival data. Biometrics. 2010, 66 (3): 705-715. 10.1111/j.1541-0420.2009.01329.x. Souverein OW, Zwinderman AH, Jukema JW, Tanck MW: Estimating effects of rare haplotypes on failure time using a penalized Cox proportional hazards regression model. BMC Genet. 2008, 9: 9- Tregouet DA, Tiret L: Cox proportional hazards survival regression in haplotype-based association analysis using the Stochastic-EM algorithm. Eur J Hum Genet. 2004, 12 (11): 971-974. 10.1038/sj.ejhg.5201238. Lubin JH, Gail MH: Biased selection of controls for case-control analyses of cohort studies. Biometrics. 1984, 40 (1): 63-75. 10.2307/2530744. Robins JM, Gail MH, Lubin JH: More on "Biased selection of controls for case-control analyses of cohort studies". Biometrics. 1986, 42 (2): 293-299. 10.2307/2531050. Nan B, Lin X: Analysis of case-control age-at-onset data using a modified case-cohort method. Biom J. 2008, 50 (2): 311-320. 10.1002/bimj.200710406. Nelson W: Theory and applications of hazard plotting for censored failure data. Technometrics. 1972, 14: 945-965. 10.1080/00401706.1972.10488991. Aalen OO: Nonparametric inference for a family of counting process. Ann Stat. 1978, 6: 701-726. 10.1214/aos/1176344247. Pena EA, Rohatgi VK: Small sample and efficiency results for the Nelson-Aalen estimator. J Stat Plann Infer. 1993, 37: 193-202. 10.1016/0378-3758(93)90088-N. Mantel N: Evaluation of survival data and two new rank order statistics arising in its consideration. Cancer Chemother Rep. 1966, 50 (3): 163-170. Breslow NE: A generalized Kruskal-Wallis test for comparing K samples subject to unequal patterns of censorship. Biometrika. 1970, 57: 579-594. 10.1093/biomet/57.3.579. Gehan EA: A generalized Wilcoxon test for comparing arbitrarily singly consored samples. Biometrika. 1965, 53: 203-223. Peto R, Peto J: Asymptotically efficient rank invariant test procedures. J Roy Stat Soc Ser Gen. 1972, 135: 185-10.2307/2344317. Andersen PK: Testing goodness of fit of cox regression and life model. Biometrics. 1982, 38: 67-77. 10.2307/2530289. Fleming TR, Harrington DP: A Class of Hypothesis Tests for One and Two Samples of Censored Survival Data. Comm Stat. 1981, 10: 763-794. 10.1080/03610928108828073. Klein JP, Moeschberger ML: Survival Analysis: Techniques for Censored and Truncated Data. 2003, New York: Springer Grucza RA, Johnson EO, Krueger RF, Breslau N, Saccone NL, Chen LS, Derringer J, Agrawal A, Lynskey M, Bierut LJ: Incorporating age at onset of smoking into genetic models for nicotine dependence: evidence for interaction with multiple genes. Addict Biol. 2010, 15 (3): 346-357. 10.1111/j.1369-1600.2010.00220.x. Lessov-Schlaggar CN, Kristjansson SD, Bucholz KK, Heath AC, Madden PA: Genetic influences on developmental smoking trajectories. Addiction. 2012, 107 (9): 1696-1704. 10.1111/j.1360-0443.2012.03871.x. Bierut LJ, Agrawal A, Bucholz KK, Doheny KF, Laurie C, Pugh E, Fisher S, Fox L, Howells W, Bertelsen S, Hinrichs AL, Almasy L, Breslau N, Culverhouse RC, Dick DM, Edenberg HJ, Foroud T, Grucza RA, Hatsukami D, Hesselbrock V, Johnson EO, Kramer J, Krueger RF, Kuperman S, Lynskey M, Mann K, Neuman RJ, Nöthen MM, Nurnberger JI, Porjesz B, et al: A genome-wide association study of alcohol dependence. Proc Natl Acad Sci U S A. 2010, 107 (11): 5082-5087. 10.1073/pnas.0911109107. Bierut LJ, Strickland JR, Thompson JR, Afful SE, Cottler LB: Drug use and dependence in cocaine dependent subjects, community-based individuals, and their siblings. Drug Alcohol Depend. 2008, 95 (1–2): 14-22. Johnson C, Drgon T, Liu QR, Zhang PW, Walther D, Li CY, Anthony JC, Ding Y, Eaton WW, Uhl GR: Genome wide association for substance dependence: convergent results from epidemiologic and research volunteer samples. BMC Med Genet. 2008, 9: 113-10.1186/1471-2350-9-113. Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007, 81 (3): 559-575. 10.1086/519795. Altshuler DM, Gibbs RA, Peltonen L, Dermitzakis E, Schaffner SF, Yu F, Bonnen PE, de Bakker PI, Deloukas P, Gabriel SB, Gwilliam R, Hunt S, Inouye M, Jia X, Palotie A, Parkin M, Whittaker P, Yu F, Chang K, Hawes A, Lewis LR, Ren Y, Wheeler D, Gibbs RA, Muzny DM, Barnes C, Darvishi K, Hurles M, Korn JM, Kristiansson K, et al: Integrating common and rare genetic variation in diverse human populations. Nature. 2010, 467 (7311): 52-58. 10.1038/nature09298. Breslau N, Johnson EO, Hiripi E, Kessler R: Nicotine dependence in the United States: prevalence, trends, and smoking persistence. Arch Gen Psychiatry. 2001, 58 (9): 810-816. 10.1001/archpsyc.58.9.810. Wang X, Elston RC, Zhu X: The meaning of interaction. Hum Hered. 2010, 70 (4): 269-277. 10.1159/000321967. Vink JM, Smit AB, de Geus EJ, Sullivan P, Willemsen G, Hottenga JJ, Smit JH, Hoogendijk WJ, Zitman FG, Peltonen L, Kaprio J, Pedersen NL, Magnusson PK, Spector TD, Kyvik KO, Morley KI, Heath AC, Martin NG, Westendorp RG, Slagboom PE, Tiemeier H, Hofman A, Uitterlinden AG, Aulchenko YS, Amin N, van Duijn C, Penninx BW, Boomsma DI: Genome-wide association study of smoking initiation and current smoking. Am J Hum Genet. 2009, 84 (3): 367-379. 10.1016/j.ajhg.2009.02.001. Tzschentke TM, Schmidt WJ: Glutamatergic mechanisms in addiction. Mol Psychiatry. 2003, 8 (4): 373-382. 10.1038/sj.mp.4001269. O'Donnell CJ, Cupples LA, D'Agostino RB, Fox CS, Hoffmann U, Hwang SJ, Ingellson E, Liu C, Murabito JM, Polak JF, Wolf PA, Demissie S: Genome-wide association study for subclinical atherosclerosis in major arterial territories in the NHLBI's Framingham Heart Study. BMC Med Genet. 2007, 8 (Suppl 1): S4-10.1186/1471-2350-8-S1-S4. Uhl GR, Liu QR, Drgon T, Johnson C, Walther D, Rose JE, David SP, Niaura R, Lerman C: Molecular genetics of successful smoking cessation: convergent genome-wide association study results. Arch Gen Psychiatry. 2008, 65 (6): 683-693. 10.1001/archpsyc.65.6.683. Kraus DM, Elliott GS, Chute H, Horan T, Pfenninger KH, Sanford SD, Foster S, Scully S, Welcher AA, Holers VM: CSMD1 is a novel multiple domain complement-regulatory protein highly expressed in the central nervous system and epithelial tissues. J Immunol. 2006, 176 (7): 4419-4430. 10.4049/jimmunol.176.7.4419. Drgon T, Montoya I, Johnson C, Liu QR, Walther D, Hamer D, Uhl GR: Genome-wide association for nicotine dependence and smoking cessation success in NIH research volunteers. Mol Med. 2009, 15 (1–2): 21-27. Breslau N, Peterson EL: Smoking cessation in young adults: age at initiation of cigarette smoking and other suspected influences. Am J Public Health. 1996, 86 (2): 214-220. 10.2105/AJPH.86.2.214. Chen J, Millar WJ: Age of smoking initiation: implications for quitting. Health Rep. 1998, 9 (4): 39-46. Eng); 39-48(Fre Kandel DB, Hu MC, Griesler PC, Schaffran C: On the development of nicotine dependence in adolescence. Drug Alcohol Depend. 2007, 91 (1): 26-39. 10.1016/j.drugalcdep.2007.04.011. Kellerer AM, Chmelevsky D: Small-sample properties of censored-data rank tests. Biometrics. 1983, 39: 675-682. 10.2307/2531095. Latta RB: A Monte Carlo study of some two sample rank tests with censored data. J Am Stat Assoc. 1981, 76: 713-719. 10.1080/01621459.1981.10477710. Heinze G, Gnant M, Schemper M: Exact log-rank tests for unequal follow-up. Biometrics. 2003, 59 (4): 1151-1157. 10.1111/j.0006-341X.2003.00132.x. Wei C, Schaid DJ, Lu Q: Trees Assembling Mann-Whitney approach for detecting genome-wide joint association among low-marginal-effect loci. Genet Epidemiol. 2013, 37 (1): 84-91. 10.1002/gepi.21693. Lin X, Cai T, Wu MC, Zhou Q, Liu G, Christiani DC: Kernel machine SNP-set analysis for censored survival outcomes in genome-wide association studies. Genet Epidemiol. 2011, 35 (7): 620-631. 10.1002/gepi.20610. Prentice RL, Breslow N: Retrospective studies and failure time models. Biometrika. 1978, 65: 153-158. 10.1093/biomet/65.1.153. Wacholder S: Bias in full cohort and nested case-control studies?. Epidemiology. 2009, 20 (3): 339-340. 10.1097/EDE.0b013e31819ec966.