Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes

International Journal of Systematic and Evolutionary Microbiology - Tập 64 Số Pt_2 - Trang 346-351 - 2014
Mincheol Kim1, Hyun‐Seok Oh2, Sang‐Cheol Park2, Jongsik Chun2,1
1School of Biological Sciences, Seoul National University, Seoul 151-742, Republic of Korea
2Interdisciplinary Program in Bioinformatics and Bioinformatics Institute, Seoul National University, Seoul 151-742, Republic of Korea

Tóm tắt

Among available genome relatedness indices, average nucleotide identity (ANI) is one of the most robust measurements of genomic relatedness between strains, and has great potential in the taxonomy of bacteria and archaea as a substitute for the labour-intensive DNA–DNA hybridization (DDH) technique. An ANI threshold range (95–96 %) for species demarcation had previously been suggested based on comparative investigation between DDH and ANI values, albeit with rather limited datasets. Furthermore, its generality was not tested on all lineages of prokaryotes. Here, we investigated the overall distribution of ANI values generated by pairwise comparison of 6787 genomes of prokaryotes belonging to 22 phyla to see whether the suggested range can be applied to all species. There was an apparent distinction in the overall ANI distribution between intra- and interspecies relationships at around 95–96 % ANI. We went on to determine which level of 16S rRNA gene sequence similarity corresponds to the currently accepted ANI threshold for species demarcation using over one million comparisons. A twofold cross-validation statistical test revealed that 98.65 % 16S rRNA gene sequence similarity can be used as the threshold for differentiating two species, which is consistent with previous suggestions (98.2–99.0 %) derived from comparative studies between DDH and 16S rRNA gene sequence similarity. Our findings should be useful in accelerating the use of genomic sequence data in the taxonomy of bacteria and archaea.

Từ khóa


Tài liệu tham khảo

Acinas, 2004, Divergence and redundancy of 16S rRNA sequences in genomes with multiple rrn operons, J Bacteriol, 186, 2629, 10.1128/JB.186.9.2629-2635.2004

Ash, 1991, Comparative analysis of Bacillus anthracis, Bacillus cereus, and related species on the basis of reverse transcriptase sequencing of 16S rRNA, Int J Syst Bacteriol, 41, 343, 10.1099/00207713-41-3-343

Chan, 2012, Defining bacterial species in the genomic era: insights from the genus Acinetobacter, BMC Microbiol, 12, 302, 10.1186/1471-2180-12-302

Chun, 2007, EzTaxon: a web-based tool for the identification of prokaryotes based on 16S ribosomal RNA gene sequences, Int J Syst Evol Microbiol, 57, 2259, 10.1099/ijs.0.64915-0

Cui, 2009, Intraspecific polymorphism of 16S rRNA genes in two halophilic archaeal genera, Haloarcula and Halomicrobium, Extremophiles, 13, 31, 10.1007/s00792-008-0194-2

Deloger, 2009, A genomic distance based on MUM indicates discontinuity between most bacterial species and genera, J Bacteriol, 191, 91, 10.1128/JB.01202-08

Devereux, 1990, Diversity and origin of Desulfovibrio species: phylogenetic definition of a family, J Bacteriol, 172, 3609, 10.1128/JB.172.7.3609-3619.1990

Gevers, 2005, Opinion: re-evaluating prokaryotic species, Nat Rev Microbiol, 3, 733, 10.1038/nrmicro1236

Goris, 2007, DNA–DNA hybridization values and their relationship to whole-genome sequence similarities, Int J Syst Evol Microbiol, 57, 81, 10.1099/ijs.0.64483-0

Grim, 2013, Pan-genome analysis of the emerging foodborne pathogen Cronobacter spp. suggests a species-level bidirectional divergence driven by niche adaptation, BMC Genomics, 14, 366, 10.1186/1471-2164-14-366

Haley, 2010, Comparative genomic analysis reveals evidence of two novel Vibrio species closely related to V. cholerae, BMC Microbiol, 10, 154, 10.1186/1471-2180-10-154

Henz, 2005, Whole-genome prokaryotic phylogeny, Bioinformatics, 21, 2329, 10.1093/bioinformatics/bth324

Keswani, 2001, Relationship of 16S rRNA sequence similarity to DNA hybridization in prokaryotes, Int J Syst Evol Microbiol, 51, 667, 10.1099/00207713-51-2-667

Kim, 2012, Introducing EzTaxon-e: a prokaryotic 16S rRNA gene sequence database with phylotypes that represent uncultured species, Int J Syst Evol Microbiol, 62, 716, 10.1099/ijs.0.038075-0

Konstantinidis, 2005, Genomic insights that advance the species definition for prokaryotes, Proc Natl Acad Sci U S A, 102, 2567, 10.1073/pnas.0409727102

Konstantinidis, 2006, The bacterial species definition in the genomic era, Philos Trans R Soc Lond B Biol Sci, 361, 1929, 10.1098/rstb.2006.1920

Lan, 2012, Using the RDP classifier to predict taxonomic novelty and reduce the search space for finding novel organisms, PLoS ONE, 7, e32491, 10.1371/journal.pone.0032491

Lee, 2011, rRNASelector: a computer program for selecting ribosomal RNA encoding sequences from metagenomic and metatranscriptomic shotgun libraries, J Microbiol, 49, 689, 10.1007/s12275-011-1213-z

McCarthy, 1963, An approach to the measurement of genetic relatedness among organisms, Proc Natl Acad Sci U S A, 50, 156, 10.1073/pnas.50.1.156

McDonald, 2012, An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea, ISME J, 6, 610, 10.1038/ismej.2011.139

Meier-Kolthoff, 2013, Genome sequence-based species delimitation with confidence intervals and improved distance functions, BMC Bioinformatics, 14, 60, 10.1186/1471-2105-14-60

Mende, 2013, Accurate and universal delineation of prokaryotic species, Nat Methods, 10, 881, 10.1038/nmeth.2575

Pei, 2010, Diversity of 16S rRNA genes within individual prokaryotic genomes, Appl Environ Microbiol, 76, 3886, 10.1128/AEM.02953-09

Read, 2011, Classifier chains for multi-label classification, Mach Learn, 85, 333, 10.1007/s10994-011-5256-5

Richter, 2009, Shifting the genomic gold standard for the prokaryotic species definition, Proc Natl Acad Sci U S A, 106, 19126, 10.1073/pnas.0906412106

Rosselló-Mora, 2001, The species concept for prokaryotes, FEMS Microbiol Rev, 25, 39, 10.1016/S0168-6445(00)00040-1

Schildkraut, 1961, The formation of hybrid DNA molecules and their use in studies of DNA homologies, J Mol Biol, 3, 595, 10.1016/S0022-2836(61)80024-7

Stackebrandt, 2006, Taxonomic parameters revisited: tarnished gold standards, Microbiol Today, 33, 152

Stackebrandt, 1994, Taxonomic note: a place for DNA-DNA reassociation and 16S rRNA sequence analysis in the present species definition in bacteriology, Int J Syst Bacteriol, 44, 846, 10.1099/00207713-44-4-846

Stackebrandt, 2002, Report of the ad hoc committee for the re-evaluation of the species definition in bacteriology, Int J Syst Evol Microbiol, 52, 1043

Tindall, 2010, Notes on the characterization of prokaryote strains for taxonomic purposes, Int J Syst Evol Microbiol, 60, 249, 10.1099/ijs.0.016949-0

van Rijsbergen, 1979, Information Retrieval

Wayne, 1987, Report of the ad hoc committee on reconciliation of approaches to bacterial systematics, Int J Syst Bacteriol, 37, 463, 10.1099/00207713-37-4-463

Yi, 2012, Comparative genomics of Neisseria weaveri clarifies the taxonomy of this species and identifies genetic determinants that may be associated with virulence, FEMS Microbiol Lett, 328, 100, 10.1111/j.1574-6968.2011.02485.x