Fast UniFrac: facilitating high-throughput phylogenetic analyses of microbial communities including analysis of pyrosequencing and PhyloChip data

ISME Journal - Tập 4 Số 1 - Trang 17-27 - 2010
Micah Hamady1, Catherine Lozupone2,3, Rob Knight3,4
1Department of Computer Science, University of Colorado, Boulder, CO, USA
2Center for Genome Sciences, Washington University School of Medicine , St Louis, MO , USA
3Department of Chemistry and Biochemistry, University of Colorado, Boulder, CO, USA
4Howard Hughes Medical Institute, Chevy Chase, MD, USA.

Tóm tắt

Abstract Next-generation sequencing techniques, and PhyloChip, have made simultaneous phylogenetic analyses of hundreds of microbial communities possible. Insight into community structure has been limited by the inability to integrate and visualize such vast datasets. Fast UniFrac overcomes these issues, allowing integration of larger numbers of sequences and samples into a single analysis. Its new array-based implementation offers orders of magnitude improvements over the original version. New 3D visualization of principal coordinates analysis results, with the option to view multiple coordinate axes simultaneously, provides a powerful way to quickly identify patterns that relate vast numbers of microbial communities. We show the potential of Fast UniFrac using examples from three data types: Sanger-sequencing studies of diverse free-living and animal-associated bacterial assemblages and from the gut of obese humans as they diet, pyrosequencing data integrated from studies of the human hand and gut, and PhyloChip data from a study of citrus pathogens. We show that a Fast UniFrac analysis using a reference tree recaptures patterns that could not be detected without considering phylogenetic relationships and that Fast UniFrac, coupled with BLAST-based sequence assignment, can be used to quickly analyze pyrosequencing runs containing hundreds of thousands of sequences, showing patterns relating human and gut samples. Finally, we show that the application of Fast UniFrac to PhyloChip data could identify well-defined subcategories associated with infection. Together, these case studies point the way toward a broad range of applications and show some of the new features of Fast UniFrac.

Từ khóa


Tài liệu tham khảo

Alexander, 2009, Microbial eukaryotes in the hypersaline anoxic L′Atalante deep-sea basin, Environ Microbiol, 11, 360, 10.1111/j.1462-2920.2008.01777.x

Altschul, 1990, Basic local alignment search tool, J Mol Biol, 215, 403, 10.1016/S0022-2836(05)80360-2

Balakirev, 2008, DNA variation and symbiotic associations in phenotypically diverse sea urchin Strongylocentrotus intermedius, Proc Natl Acad Sci USA, 105, 16218, 10.1073/pnas.0807860105

Bryant, 2008, Colloquium paper: microbes on mountainsides: contrasting elevational patterns of bacterial and plant diversity, Proc Natl Acad Sci USA, 105, 11505, 10.1073/pnas.0801920105

DeSantis, 2007, High-density universal 16S rRNA microarray analysis reveals broader diversity than typical clone library when sampling the environment, Microb Ecol, 53, 371, 10.1007/s00248-006-9134-9

DeSantis, 2006, Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB, Appl Environ Microbiol, 72, 5069, 10.1128/AEM.03006-05

Desnues, 2008, Biodiversity and biogeography of phages in modern stromatolites and thrombolites, Nature, 452, 340, 10.1038/nature06735

Elifantz, 2008, Diversity and abundance of glycosyl hydrolase family 5 in the North Atlantic Ocean, FEMS Microbiol Ecol, 63, 316, 10.1111/j.1574-6941.2007.00429.x

Fierer, 2008, The influence of sex, handedness, and washing on the diversity of hand surface bacteria, Proc Natl Acad Sci USA, 105, 17994, 10.1073/pnas.0807920105

Frank, 2007, Molecular-phylogenetic characterization of microbial community imbalances in human inflammatory bowel diseases, Proc Natl Acad Sci USA, 104, 13780, 10.1073/pnas.0706625104

Fraune, 2007, Long-term maintenance of species-specific bacterial microbiota in the basal metazoan Hydra, Proc Natl Acad Sci USA, 104, 13146, 10.1073/pnas.0703375104

Graham, 2008, Phylogenetic beta diversity: linking ecological and evolutionary processes across space in time, Ecol Lett, 11, 1265, 10.1111/j.1461-0248.2008.01256.x

Grice, 2008, A diversity profile of the human skin microbiota, Genome Res, 18, 1043, 10.1101/gr.075549.107

Hamady, 2008, Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplex, Nat Methods, 5, 235, 10.1038/nmeth.1184

Harrison, 2009, Variations in archaeal and bacterial diversity associated with the sulfate-methane transition zone in continental margin sediments (Santa Barbara Basin, California), Appl Environ Microbiol, 75, 1487, 10.1128/AEM.01812-08

Hartman, 2008, Environmental and anthropogenic controls over bacterial communities in wetland soils, Proc Natl Acad Sci USA, 105, 17842, 10.1073/pnas.0808254105

Hiibel, 2008, Microbial community analysis of two field-scale sulfate-reducing bioreactors treating mine drainage, Environ Microbiol, 10, 2087, 10.1111/j.1462-2920.2008.01630.x

Hsu, 2009, Evidence for the functional significance of diazotroph community structure in soil, ISME J, 3, 124, 10.1038/ismej.2008.82

Huber, 2007, Microbial population structures in the deep marine biosphere, Science, 318, 97, 10.1126/science.1146689

Kanagawa, 2003, Bias and artifacts in multitemplate polymerase chain reactions (PCR), J Biosci Bioeng, 96, 317, 10.1016/S1389-1723(03)90130-7

Knight, 2007, PyCogent: a toolkit for making sense from sequence, Genome Biol, 8, R171, 10.1186/gb-2007-8-8-r171

Lauber, 2009, Laccase gene composition and relative abundance in oak forest soil is not affected by short-term nitrogen fertilization, Microb Ecol, 57, 50, 10.1007/s00248-008-9437-0

Ley, 2008, Evolution of mammals and their gut microbes, Science, 320, 1647, 10.1126/science.1155725

Ley, 2008, Worlds within worlds: evolution of the vertebrate gut microbiota, Nat Rev Microbiol, 6, 776, 10.1038/nrmicro1978

Ley, 2006, Microbial ecology: human gut microbes associated with obesity, Nature, 444, 1022, 10.1038/4441022a

Li, 2008, Symbiotic gut microbes modulate human metabolic phenotypes, Proc Natl Acad Sci USA, 105, 2117, 10.1073/pnas.0712038105

Lozupone, 2006, UniFrac—an online tool for comparing microbial community diversity in a phylogenetic context, BMC Bioinformatics, 7, 371, 10.1186/1471-2105-7-371

Lozupone, 2005, UniFrac: a new phylogenetic method for comparing microbial communities, Appl Environ Microbiol, 71, 8228, 10.1128/AEM.71.12.8228-8235.2005

Lozupone, 2008, The convergence of carbohydrate active gene repertoires in human gut microbes, Proc Natl Acad Sci USA, 105, 15076, 10.1073/pnas.0807339105

Lozupone, 2007, Global patterns in bacterial diversity, Proc Natl Acad Sci USA, 104, 11436, 10.1073/pnas.0611525104

Lozupone, 2008, Species divergence and the measurement of microbial diversity, FEMS Microbiol Rev, 32, 557, 10.1111/j.1574-6976.2008.00111.x

Ludwig, 2004, ARB: a software environment for sequence data, Nucleic Acids Res, 32, 1363, 10.1093/nar/gkh293

Marhaver, 2008, Viral communities associated with healthy and bleaching corals, Environ Microbiol, 10, 2277, 10.1111/j.1462-2920.2008.01652.x

Martin, 2002, Phylogenetic approaches for describing and comparing the diversity of microbial communities, Appl Environ Microbiol, 68, 3673, 10.1128/AEM.68.8.3673-3682.2002

Nasidze, 2009, Global diversity in the human salivary microbiome, Genome Res, 19, 636, 10.1101/gr.084616.108

Osman, 2008, Microbial burden and diversity of commercial airline cabin air during short and long durations of travel, ISME J, 2, 482, 10.1038/ismej.2008.11

Porter, 2008, Fruiting body and soil rDNA sampling detects complementary assemblage of Agaricomycotina (Basidiomycota, Fungi) in a hemlock-dominated forest plot in southern Ontario, Mol Ecol, 17, 3037, 10.1111/j.1365-294X.2008.03813.x

Rawls, 2006, Reciprocal gut microbiota transplants from zebrafish and mice to germ-free recipients reveal host habitat selection, Cell, 127, 423, 10.1016/j.cell.2006.08.043

Roesch, 2007, Pyrosequencing enumerates and contrasts soil microbial diversity, ISME J, 1, 283, 10.1038/ismej.2007.53

Sagaram, 2009, Bacterial diversity analysis of Huanglongbing pathogen-infected citrus, using PhyloChip arrays and 16S rRNA gene clone library sequencing, Appl Environ Microbiol, 75, 1566, 10.1128/AEM.02404-08

Sogin, 2006, Microbial diversity in the deep sea and the underexplored ‘rare biosphere’, Proc Natl Acad Sci USA, 103, 12115, 10.1073/pnas.0605127103

Turnbaugh, 2009, A core gut microbiome in obese and lean twins, Nature, 457, 480, 10.1038/nature07540

Turnbaugh, 2007, The human microbiome project, Nature, 449, 804, 10.1038/nature06244

Turnbaugh, 2006, An obesity-associated gut microbiome with increased capacity for energy harvest, Nature, 444, 1027, 10.1038/nature05414

Wen, 2008, Innate immunity and intestinal microbiota in the development of Type 1 diabetes, Nature, 455, 1109, 10.1038/nature07336

Widmann, 2006, DivergentSet, a tool for picking non-redundant sequences from large sequence collections, Mol Cell Proteomics, 5, 1520, 10.1074/mcp.T600022-MCP200

Wilson, 2002, High-density microarray of small-subunit ribosomal DNA probes, Appl Environ Microbiol, 68, 2535, 10.1128/AEM.68.5.2535-2541.2002

Zhu, 2006, Automatic dimensionality selection from the scree plot via the use of profile likelihood, Comput Stat Data Anal, 51, 918, 10.1016/j.csda.2005.09.010