antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline

Nucleic Acids Research - Tập 47 Số W1 - Trang W81-W87 - 2019
Kai Blin1, Simon J. Shaw1, Kat Steinke2, Rasmus Villebro1, Nadine Ziemert2, Sang Yup Lee3,1, Marnix H. Medema4, Tilmann Weber1
1The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet bygning 220, 2800 Kgs. Lyngby, Denmark
2German Centre for Infection Research (DZIF), Interfaculty Institute of Microbiology and Infection Medicine, Auf der Morgenstelle 28, University of Tübingen, 72076 Tübingen, DE, Germany
3Department of Chemical and Biomolecular Engineering (BK21 Plus Program) and BioInformatics Research Center, Korea Advanced Institute of Science and Technology, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, South Korea
4Bioinformatics Group, Wageningen University, Droevendaalsesteeg 1, 6708PB, Wageningen, The Netherlands

Tóm tắt

Abstract Secondary metabolites produced by bacteria and fungi are an important source of antimicrobials and other bioactive compounds. In recent years, genome mining has seen broad applications in identifying and characterizing new compounds as well as in metabolic engineering. Since 2011, the ‘antibiotics and secondary metabolite analysis shell—antiSMASH’ (https://antismash.secondarymetabolites.org) has assisted researchers in this, both as a web server and a standalone tool. It has established itself as the most widely used tool for identifying and analysing biosynthetic gene clusters (BGCs) in bacterial and fungal genome sequences. Here, we present an entirely redesigned and extended version 5 of antiSMASH. antiSMASH 5 adds detection rules for clusters encoding the biosynthesis of acyl-amino acids, β-lactones, fungal RiPPs, RaS-RiPPs, polybrominated diphenyl ethers, C-nucleosides, PPY-like ketones and lipolanthines. For type II polyketide synthase-encoding gene clusters, antiSMASH 5 now offers more detailed predictions. The HTML output visualization has been redesigned to improve the navigation and visual representation of annotations. We have again improved the runtime of analysis steps, making it possible to deliver comprehensive annotations for bacterial genomes within a few minutes. A new output file in the standard JavaScript object notation (JSON) format is aimed at downstream tools that process antiSMASH results programmatically.

Từ khóa


Tài liệu tham khảo

Newman, 2016, Natural products as sources of new drugs from 1981 to 2014, J. Nat. Prod., 79, 629, 10.1021/acs.jnatprod.5b01055

van der Meij, 2017, Chemical ecology of antibiotic production by actinomycetes, FEMS Microbiol. Rev., 41, 392, 10.1093/femsre/fux005

Ziemert, 2016, The evolution of genome mining in microbes - a review, Nat. Prod. Rep., 33, 988, 10.1039/C6NP00025H

Weber, 2009, CLUSEAN: a computer-based framework for the automated analysis of bacterial secondary metabolite biosynthetic gene clusters, J. Biotechnol., 140, 13, 10.1016/j.jbiotec.2009.01.007

Skinnider, 2017, PRISM 3: expanded prediction of natural product chemical structures from microbial genomes, Nucleic Acids Res., 45, W49, 10.1093/nar/gkx320

Medema, 2011, antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences, Nucleic Acids Res., 39, W339, 10.1093/nar/gkr466

Blin, 2013, antiSMASH 2.0—a versatile platform for genome mining of secondary metabolite producers, Nucleic Acids Res., 41, W204, 10.1093/nar/gkt449

Weber, 2015, antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters, Nucleic Acids Res., 43, W237, 10.1093/nar/gkv437

Blin, 2017, antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification, Nucleic Acids Res., 45, W36, 10.1093/nar/gkx319

Kautsar, 2017, plantiSMASH: automated identification, annotation and expression analysis of plant biosynthetic gene clusters, Nucleic Acids Res., 45, W55, 10.1093/nar/gkx305

Blin, 2014, Improved lanthipeptide detection and prediction for antiSMASH, PLoS One, 9, e89420, 10.1371/journal.pone.0089420

Villebro, 2019, Sequence-based classification of type II polyketide synthase biosynthetic gene clusters for antiSMASH, J. Ind. Microbiol. Biotechnol., 46, 469, 10.1007/s10295-018-02131-9

Blin, 2017, The antiSMASH database, a comprehensive database of microbial secondary metabolite biosynthetic gene clusters, Nucleic Acids Res., 45, D555, 10.1093/nar/gkw960

Blin, 2019, The antiSMASH database version 2: a comprehensive resource on secondary metabolite biosynthetic gene clusters, Nucleic Acids Res., 47, D625, 10.1093/nar/gky1060

Medema, 2014, Pep2Path: automated mass spectrometry-guided genome mining of peptidic natural products, PLoS Comput. Biol., 10, e1003822, 10.1371/journal.pcbi.1003822

Alanjary, 2017, The Antibiotic Resistant Target Seeker (ARTS), an exploration engine for antibiotic cluster prioritization and novel drug target discovery, Nucleic Acids Res., 45, W42, 10.1093/nar/gkx360

Blin, 2016, CRISPy-web: An online resource to design sgRNAs for CRISPR applications, Synth. Syst. Biotechnol., 1, 118, 10.1016/j.synbio.2016.01.003

Shirley, 2018, Unzipping natural products: improved natural product structure predictions by ensemble modeling and fingerprint matching, 10.26434/chemrxiv.6863864.v1

Navarro-Muñoz, 2018, A computational framework for systematic exploration of biosynthetic diversity from large-scale genomic data, 10.1101/445270

Finn, 2016, The Pfam protein families database: towards a more sustainable future, Nucleic Acids Res., 44, D279, 10.1093/nar/gkv1344

Letunic, 2018, 20 years of the SMART protein domain annotation resource, Nucleic Acids Res., 46, D493, 10.1093/nar/gkx922

de Jong, 2010, BAGEL2: mining for bacteriocins in genomic data, Nucleic Acids Res., 38, W647, 10.1093/nar/gkq365

Yadav, 2009, Towards prediction of metabolic products of polyketide synthases: an in silico analysis, PLoS Comput. Biol., 5, e1000351, 10.1371/journal.pcbi.1000351

Craig, 2011, Long-chain N-acyl amino acid synthases are linked to the putative PEP-CTERM/exosortase protein-sorting system in Gram-negative bacteria, J. Bacteriol., 193, 5707, 10.1128/JB.05426-11

Robinson, 2018, Biosynthesis and chemical diversity of β-lactone natural products, Nat. Prod. Rep., 36, 458, 10.1039/C8NP00052B

Agarwal, 2017, Metagenomic discovery of polybrominated diphenyl ether biosynthesis by marine sponges, Nat. Chem. Biol., 13, 537, 10.1038/nchembio.2330

Sosio, 2018, Analysis of the Pseudouridimycin biosynthetic pathway provides Insights into the formation of C-nucleoside antibiotics, Cell Chem. Biol., 25, 540, 10.1016/j.chembiol.2018.02.008

Bauer, 2015, Biosynthetic origin of the antibiotic pseudopyronines A and B in Pseudomonas putida BW11M1, Chembiochem, 16, 2491, 10.1002/cbic.201500413

Luo, 2012, Ribosomal biosynthesis of α-amanitin in Galerina marginata, Fungal Genet. Biol., 49, 123, 10.1016/j.fgb.2011.12.005

Nagano, 2016, Class of cyclic ribosomal peptide synthetic genes in filamentous fungi, Fungal Genet. Biol., 86, 58, 10.1016/j.fgb.2015.12.010

Ding, 2016, Biosynthetic investigation of phomopsins reveals a widespread pathway for ribosomal natural products in Ascomycetes, Proc. Natl. Acad. Sci. U.S.A., 113, 3521, 10.1073/pnas.1522907113

Bushin, 2018, Charting an unexplored streptococcal biosynthetic landscape reveals a unique peptide cyclization motif, J. Am. Chem. Soc., 140, 17674, 10.1021/jacs.8b10266

Caruso, 2019, A radical approach to enzymatic β-Thioether bond formation, J. Am. Chem. Soc., 141, 990, 10.1021/jacs.8b11060

Gibson, 2015, Improved annotation of antibiotic resistance determinants reveals microbial resistomes cluster by ecology, ISME J., 9, 207, 10.1038/ismej.2014.106

Yi, 2007, Identifying clusters of functionally related genes in genomes, Bioinformatics, 23, 1053, 10.1093/bioinformatics/btl673

Inglis, 2013, Comprehensive annotation of secondary metabolite biosynthetic genes and gene clusters of Aspergillus nidulans, A. fumigatus, A. niger and A. oryzae, BMC Microbiol., 13, 91, 10.1186/1471-2180-13-91

The Gene Ontology Consortium, 2016, Expansion of the Gene Ontology knowledgebase and resources, Nucleic Acids Res., 45, D331, 10.1093/nar/gkw1108

Medema, 2015, Minimum information about a biosynthetic gene cluster, Nat. Chem. Biol., 11, 625, 10.1038/nchembio.1890

Medema, 2013, Detecting sequence homology at the gene cluster level with MultiGeneBlast, Mol. Biol. Evol., 30, 1218, 10.1093/molbev/mst025

Baltz, 2018, Natural product drug discovery in the genomic era: realities, conjectures, misconceptions, and opportunities, J. Ind. Microbiol. Biotechnol., 46, 281, 10.1007/s10295-018-2115-4

Cimermancic, 2014, Insights into secondary metabolism from a global analysis of prokaryotic biosynthetic gene clusters, Cell, 158, 412, 10.1016/j.cell.2014.06.034

Blin, 2017, Recent development of antiSMASH and other computational approaches to mine secondary metabolite biosynthetic gene clusters, Brief. Bioinform.