Large-scale generation and analysis of filamentous fungal DNA barcodes boosts coverage for kingdom fungi and reveals thresholds for fungal species and higher taxon delimitation

Studies in Mycology - Tập 92 Số 1 - Trang 135-154 - 2019
Duong Vu1, Marizeth Groenewald1, Michèl de Vries1, Thies Gehrmann1, J. Benjamin Stielow1, Ursula Eberhardt2, Abdullah M. S. Al‐Hatmi1, J.Z. Groenewald1, Gianluigi Cardinali3, Jos Houbraken1, Teun Boekhout4,1, P.W. Crous5,6,1, Vincent Robert1, G.J.M. Verkley1
1Westerdijk Fungal Biodiversity Institute, Uppsalalaan 8, 3584 CT, Utrecht, The Netherlands
2Staatliches Museum f. Naturkunde Stuttgart, Abt. Botanik, Rosenstein 1, D-70191 Stuttgart, Germany;
3University of Perugia, Dept. of Pharmaceutical Sciences, Via Borgo 20 Giugno 74, I 06121 Perugia, Italy;
4Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, The Netherlands
5Department of Genetics, Biochemistry and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria 0028, South Africa
6Wageningen University and Research Centre (WUR), Laboratory of Phytopathology, Droevendaalsesteeg 1, 6708 PB Wageningen, The Netherlands

Tóm tắt

Species identification lies at the heart of biodiversity studies that has in recent years favoured DNA-based approaches. Microbial Biological Resource Centres are a rich source for diverse and high-quality reference materials in microbiology, and yet the strains preserved in these biobanks have been exploited only on a limited scale to generate DNA barcodes. As part of a project funded in the Netherlands to barcode specimens of major national biobanks, sequences of two nuclear ribosomal genetic markers, the Internal Transcribed Spaces and 5.8S gene (ITS) and the D1/D2 domain of the 26S Large Subunit (LSU), were generated as DNA barcode data for ca. 100 000 fungal strains originally assigned to ca. 17 000 species in the CBS fungal biobank maintained at the Westerdijk Fungal Biodiversity Institute, Utrecht. Using more than 24 000 DNA barcode sequences of 12 000 ex-type and manually validated filamentous fungal strains of 7 300 accepted species, the optimal identity thresholds to discriminate filamentous fungal species were predicted as 99.6 % for ITS and 99.8 % for LSU. We showed that 17 % and 18 % of the species could not be discriminated by the ITS and LSU genetic markers, respectively. Among them, ∼8 % were indistinguishable using both genetic markers. ITS has been shown to outperform LSU in filamentous fungal species discrimination with a probability of correct identification of 82 % vs. 77.6 %, and a clustering quality value of 84 % vs. 77.7 %. At higher taxonomic classifications, LSU has been shown to have a better discriminatory power than ITS. With a clustering quality value of 80 %, LSU outperformed ITS in identifying filamentous fungi at the ordinal level. At the generic level, the clustering quality values produced by both genetic markers were low, indicating the necessity for taxonomic revisions at genus level and, likely, for applying more conserved genetic markers or even whole genomes. The taxonomic thresholds predicted for filamentous fungal identification at the genus, family, order and class levels were 94.3 %, 88.5 %, 81.2 % and 80.9 % based on ITS barcodes, and 98.2 %, 96.2 %, 94.7 % and 92.7 % based on LSU barcodes. The DNA barcodes used in this study have been deposited to GenBank and will also be publicly available at the Westerdijk Institute's website as reference sequences for fungal identification, marking an unprecedented data release event in global fungal barcoding efforts to date.

Từ khóa


Tài liệu tham khảo

Afshinnekoo, 2015, Geospatial Resolution of Human and Bacterial Diversity with City-Scale Metagenomics, Cell Systems, 1, 72, 10.1016/j.cels.2015.01.001

Altschul, 1997, Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Research, 25, 3389, 10.1093/nar/25.17.3389

Blaalid, 2013, ITS1 versus ITS2 as DNA metabarcodes for fungi, Molecular Ecology Resources, 13, 218, 10.1111/1755-0998.12065

Boon, 2010, Intra-isolate genome variation in arbuscular mycorrhizal fungi persists in the transcriptome, Journal of Evolutionary Biology, 23, 1519, 10.1111/j.1420-9101.2010.02019.x

Botschuijver, 2017, Intestinal Fungal Dysbiosis Associates With Visceral Hypersensitivity in Patients With Irritable Bowel Syndrome and Rats, Gastroenterology, 153, 1026, 10.1053/j.gastro.2017.06.004

CBOL Plant Working Group, 2009, A DNA barcode for land plants, PNAS, 106, 12794, 10.1073/pnas.0905845106

Chen, 2017, Didymellaceae revisited, Studies in Mycology, 87, 105, 10.1016/j.simyco.2017.06.002

Coissac, 2016, From barcodes to genomes: Extending the concept of DNA barcoding, Molecular Ecology, 25, 1423, 10.1111/mec.13549

Cui, 2013, The human mycobiome in health and disease, Genome Medicine, 5, 1, 10.1186/gm467

De Queiroz, 2007, Species concepts and species delimitation, Systematic Botany, 56, 879

Dujon, 2004, Genome evolution in yeasts, Nature, 430, 35, 10.1038/nature02579

Eberhardt, 2012, Methods for DNA barcoding of fungi, DNA Barcodes: Methods and Protocols, 858, 183, 10.1007/978-1-61779-591-6_9

Edgar, 2018, Updating the 97 % identity threshold for 16S ribosomal RNA OTUs, Bioinformatics, 10.1093/bioinformatics/bty113

Federhen, 2015, Type material in the NCBI Taxonomy Database, Nucleic Acids Research, 43, D1086, 10.1093/nar/gku1127

Fell, 2000, Biodiversity and systematics of basidiomycetous yeasts as determined by large-subunit rDNA D1 / D2 domain sequence analysis, International Journal of Systematic and Evolutionary Microbiology, 50, 1351, 10.1099/00207713-50-3-1351

Fuhrman, 2009, Microbial community structure and its functional implications, Nature, 459, 193, 10.1038/nature08058

Galagan, 2005, Genomics of the fungal kingdom: insights into eukaryotic biology, Genome Research, 15, 1620, 10.1101/gr.3767105

Garza, 2016, Bottom-up ecology of the human microbiome: from metagenomes to metabolomes, BioRxiv

Geml, 2014, The contribution of DNA metabarcoding to fungal conservation: diversity assessment, habitat partitioning and mapping red-listed fungi in protected coastal Salix repens communities in the Netherlands, PLoS One, 9, e99852, 10.1371/journal.pone.0099852

Gweon, 2015, PIPITS: an automated pipeline for analyses of fungal internal transcribed spacer sequences from the Illumina sequencing platform, Methods in Ecology and Evolution, 6, 973, 10.1111/2041-210X.12399

Handelsman, 2004, Metagenomics: Application of Genomics to Uncultured Microorganisms, Microbiology and Molecular Biology Reviews, 68, 669, 10.1128/MMBR.68.4.669-685.2004

Hawksworth, 2017, Fungal diversity revisited: 2.2 to 3.8 million species, Microbiology Spectrum, 5, 1, 10.1128/microbiolspec.FUNK-0052-2016

Hebert, 2003, Biological identifications through DNA barcodes, Proceedings of the Royal Society B, 270, 313, 10.1098/rspb.2002.2218

Hibbett, 2016, The invisible dimension of fungal diversity, Science, 351, 1150, 10.1126/science.aae0380

Houbraken, 2011, Phylogeny of Penicillium and the segregation of Trichocomaceae into three families, Studies in Mycology, 70, 1, 10.3114/sim.2011.70.01

Huttenhower, 2012, Structure, function and diversity of the healthy human microbiome, Nature, 486, 207, 10.1038/nature11234

Irinyi, 2015, International Society of Human and Animal Mycology (ISHAM)-ITS reference DNA barcoding database – The quality controlled standard tool for routine identification of human and animal pathogenic fungi, Medical Mycology, 53, 313, 10.1093/mmy/myv008

Kellis, 2004, Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae, Nature, 428, 617, 10.1038/nature02424

Kim, 2014, Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes, International Journal of Systematic and Evolutionary Microbiology, 64, 346, 10.1099/ijs.0.059774-0

Kiss, 2012, Limits of nuclear ribosomal DNA internal transcribed spacer (ITS) sequences as species barcodes for Fungi, PNAS, 109, 10.1073/pnas.1207143109

Koljalg, 2013, Towards a unified paradigm for sequence-based identification of fungi, Molecular Ecology, 22, 5271, 10.1111/mec.12481

Kooij, 2015, Evolutionarily advanced ant farmers rear polyploid fungal crops, Journal of Evolutionary Biology, 28, 1911, 10.1111/jeb.12718

Kurtzman, 1998, Identification and phylogeny of ascomycetous yeasts from analysis of nuclear large subunit (26S) ribosomal DNA partial sequences, Antonie Van Leeuwenhoek, 73, 331, 10.1023/A:1001761008817

Levy, 2017, Microbiome, metabolites and host immunity, Current Opinion in Microbiology, 35, 8, 10.1016/j.mib.2016.10.003

Liu, 2016, Phylogeny of tremellomycetous yeasts and related dimorphic and filamentous basidiomycetes reconstructed from multiple gene sequence analyses, Studies in Mycology, 81, 1, 10.1016/j.simyco.2015.08.001

Mau, 2014, Linking soil bacterial biodiversity and soil carbon stability, The ISME Journal, 9, 1477, 10.1038/ismej.2014.205

Mohanta, 2015, The diversity of fungal genome, Biological Procedures Online, 17, 8, 10.1186/s12575-015-0020-z

Nguyen, 2015, The lung mycobiome: An emerging field of the human respiratory microbiome, Frontiers in Microbiology, 6, 1, 10.3389/fmicb.2015.00089

Nilsson, 2008, Intraspecific ITS variability in the Kingdom Fungi as expressed in the international sequence databases and its implications for molecular species identification, Evolutionary Bioinformatics, 4, 193, 10.4137/EBO.S653

Nilsson, 2006, Taxonomic reliability of DNA sequences in public sequences databases: A fungal perspective, PLoS One, 1, e59, 10.1371/journal.pone.0000059

Nilsson, 2016, Top 50 most wanted fungi, MycoKeys, 12, 29, 10.3897/mycokeys.12.7553

Paccanaro, 2006, Spectral clustering of protein sequences, Nucleic Acids Research, 34, 1571, 10.1093/nar/gkj515

Quandt, 2015, Metagenome sequence of Elaphomyces granulatus from sporocarp tissue reveals Ascomycota ectomycorrhizal fingerprints of genome expansion and a Proteobacteria-rich microbiome, Environmental Microbiology, 17, 2952, 10.1111/1462-2920.12840

Robbertse, 2017, Improving taxonomic accuracy for fungi in public sequence databases: applying 'one name one species' in well-defined genera with Trichoderma/Hypocrea as a test case, Database, 10.1093/database/bax072

Robert, 2011, BioloMICS Software: Biological data management, identification, classification and statistics, The Open Applied Informatics Journal, 5, 87, 10.2174/1874136301005010087

Robert, 2013, MycoBank gearing up for new horizons, IMA Fungus, 4, 371, 10.5598/imafungus.2013.04.02.16

Schoch, 2012, Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi, PNAS, 109, 1, 10.1073/pnas.1117018109

Simon, 2008, Intragenomic variation of fungal ribosomal genes is higher than previously thought, Molecular Biology and Evolution, 25, 2251, 10.1093/molbev/msn188

Stackebrandt, 2006, Taxonomic parameters revisited: tarnished gold standards, Microbiology Today, 33, 152

Stielow, 2015, One fungus, which genes? Development and assessment of universal primers for potential secondary fungal DNA barcodes, Persoonia, 35, 242, 10.3767/003158515X689135

Strope, 2015, The 100-genomes strains, an S. cerevisiae resource that illuminates its natural phenotypic and genotypic variation and emergence as an opportunistic pathogen, Genome Research, 125, 762, 10.1101/gr.185538.114

Tang, 2016, Visualizing Large-scale and High-dimensional Data, 287

Tedersoo, 2014, Global diversity and geography of soil fungi, Science, 346, 1256688, 10.1126/science.1256688

UNITE Community, 2017

Verkley, 2013, A new approach to species delimitation in Septoria, Studies in Mycology, 75, 213, 10.3114/sim0018

Videira, 2017, Mycosphaerellaceae – chaos or clarity?, Studies in Mycology, 87, 257, 10.1016/j.simyco.2017.09.003

Vu, 2018, fMLC: Fast multi-level clustering and visualization of large molecular datasets, Bioinformatics, 34, 1577, 10.1093/bioinformatics/btx810

Vu, 2016, DNA barcoding analysis of more than 9000 yeast isolates contributes to quantitative thresholds for yeast species and genera delimitation, Studies in Mycology, 85, 91, 10.1016/j.simyco.2016.11.007

Vu, 2014, Massive fungal biodiversity data re-annotation with multi-level clustering, Scientific Reports, 4, 6837, 10.1038/srep06837

Vu, 2012, A laboratory information management system for DNA barcoding workflows, Integrative Biology, 4, 744, 10.1039/c2ib00146b

Wang, 2016, Multigene phylogeny and taxonomic revision of yeasts and related fungi in the Ustilaginomycotina, Studies in Mycology, 81, 55, 10.1016/j.simyco.2015.10.004

Wang, 2016, Multigene phylogeny and reclassification of yeasts and related filamentous taxa in Basidiomycota, Studies in Mycology, 81, 27, 10.1016/j.simyco.2015.08.002

Woudenberg, 2013, Alternaria redefined, Studies in Mycology, 75, 171, 10.3114/sim0015

Woudenberg, 2015, Alternaria section Alternaria: Species, formae speciales or pathotypes?, Studies in Mycology, 82, 1, 10.1016/j.simyco.2015.07.001

Yang, 2017, Families, genera, and species of Botryosphaeriales, Fungal Biology, 121, 322, 10.1016/j.funbio.2016.11.001

Yarza, 2014, Uniting the classification of cultured and uncultured bacteria and archaea using 16S rRNA gene sequences, Nature Reviews Microbiology, 12, 635, 10.1038/nrmicro3330