Small Sample Kernel Association Tests for Human Genetic and Microbiome Association Studies

Genetic Epidemiology - Tập 40 Số 1 - Trang 5-19 - 2016
Jun Chen1,2, Wenan Chen1,2, Ni Zhao3, Michael C. Wu3, Daniel J. Schaid1
1Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, United States of America
2This author is the co-first author.
3Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America

Tóm tắt

AbstractKernel machine based association tests (KAT) have been increasingly used in testing the association between an outcome and a set of biological measurements due to its power to combine multiple weak signals of complex relationship with the outcome through the specification of a relevant kernel. Human genetic and microbiome association studies are two important applications of KAT. However, the classic KAT framework relies on large sample theory, and conservativeness has been observed for small sample studies, especially for microbiome association studies. The common approach for addressing the small sample problem relies on computationally intensive resampling methods. Here, we derive an exact test for KAT with continuous traits, which resolve the small sample conservatism of KAT without the need for resampling. The exact test has significantly improved power to detect association for microbiome studies. For binary traits, we propose a similar approximate test, and we show that the approximate test is very powerful for a wide range of kernels including common variant‐ and microbiome‐based kernels, and the approximate test controls the type I error well for these kernels. In contrast, the sequence kernel association tests have slightly inflated genomic inflation factors after small sample adjustment. Extensive simulations and application to a real microbiome association study are used to demonstrate the utility of our method.

Từ khóa


Tài liệu tham khảo

10.1038/nmeth.f.303

10.1371/journal.pone.0015216

10.1007/978-1-4614-7846-1_16

10.1093/bioinformatics/bts342

10.1038/nrg3182

10.2307/2346911

10.1016/j.csda.2009.11.025

10.1038/ng.2344

10.1016/j.ajhg.2013.04.015

10.1016/j.ajhg.2007.10.010

10.1016/j.ajhg.2012.06.007

10.1111/j.1541-0420.2007.00799.x

10.1186/1471-2105-9-292

10.1038/nrg2344

McCullaghP NelderJA.1989.Generalized Linear Models. Chapman and Hall/CRC London.

10.1007/s10852-005-9022-1

10.1101/gr.3709305

10.1002/gepi.21877

10.1126/science.1208344

10.1016/j.ajhg.2010.05.002

10.1016/j.ajhg.2011.05.029

10.1016/j.ajhg.2015.04.003