A multivariate distance‐based analytic framework for microbial interdependence association test in longitudinal study

Genetic Epidemiology - Tập 41 Số 8 - Trang 769-778 - 2017
Yilong Zhang1, Sung Won Han2, Laura M. Cox3, Huilin Li4,5
1Merck Research Laboratories, Rahway, New Jersey, United States of America
2Fusion Data Analytics Lab, School of Industrial Management Engineering, Korea University, Seoul, South Korea
3Department of Neurology, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
4Department of Environmental Medicine, NYU Langone Medical Center, New York, NY, United States of America
5Department of Population Health (Biostatistics), NYU Langone Medical Center, New York, NY, United States of America

Tóm tắt

ABSTRACTHuman microbiome is the collection of microbes living in and on the various parts of our body. The microbes living on our body in nature do not live alone. They act as integrated microbial community with massive competing and cooperating and contribute to our human health in a very important way. Most current analyses focus on examining microbial differences at a single time point, which do not adequately capture the dynamic nature of the microbiome data. With the advent of high‐throughput sequencing and analytical tools, we are able to probe the interdependent relationship among microbial species through longitudinal study. Here, we propose a multivariate distance‐based test to evaluate the association between key phenotypic variables and microbial interdependence utilizing the repeatedly measured microbiome data. Extensive simulations were performed to evaluate the validity and efficiency of the proposed method. We also demonstrate the utility of the proposed test using a well‐designed longitudinal murine experiment and a longitudinal human study. The proposed methodology has been implemented in the freely distributed open‐source R package and Python code.

Từ khóa


Tài liệu tham khảo

Aitchison J., 2003, The statistical analysis of compositional data

10.1111/j.1442-9993.2001.01070.pp.x

10.1126/scitranslmed.aad7121

10.1186/s13059-016-0980-6

10.1128/IAI.05496-11

Cario M. C. &Nelson B. L.(1997).Modeling and generating random vectors with arbitrary marginal distributions and correlation matrix. Technical Report Department of Industrial Engineering and Management Sciences. Evanston IL: Northwestern University.

10.1093/bioinformatics/btw308

10.1038/nrg3182

10.1038/nature11400

10.1016/j.cell.2014.05.052

10.1371/journal.pcbi.1002606

10.1371/journal.pcbi.1002687

10.1093/biomet/ass070

10.1371/journal.pone.0009085

10.1016/j.cell.2012.10.052

10.1890/0012-9658(2001)082[0290:FMMTCD]2.0.CO;2

10.1038/ncomms8486

10.1098/rspl.1896.0076

10.1128/mBio.01135-14

10.4161/psb.6.1.14191

10.1111/j.1541-0420.2009.01300.x

10.1126/science.1205438

10.1093/bioinformatics/btw311

10.1038/ismej.2014.147

10.1038/nature07540

10.1038/nature05414

10.1038/ismej.2016.37

10.1007/s00284-010-9582-9

10.1093/bioinformatics/bts668