Using SPAdes De Novo Assembler

Current Protocols in Bioinformatics - Tập 70 Số 1 - 2020
Andrey D. Prjibelski1, Dmitry Antipov1, Dmitry Meleshko1, Alla Lapidus1,2, Anton Korobeynikov1,3
1Center for Algorithmic Biotechnologies Saint Petersburg State University Saint Petersburg Russia
2Department of Cytology and Histology, Saint Petersburg State University, Saint Petersburg, Russia
3Department of Statistical Modelling Saint Petersburg State University Saint Petersburg Russia

Tóm tắt

AbstractSPAdes—St. Petersburg genome Assembler—was originally developed for de novo assembly of genome sequencing data produced for cultivated microbial isolates and for single‐cell genomic DNA sequencing. With time, the functionality of SPAdes was extended to enable assembly of IonTorrent data, as well as hybrid assembly from short and long reads (PacBio and Oxford Nanopore). In this article we present protocols for five different assembly pipelines that comprise the SPAdes package and that are used for assembly of metagenomes and transcriptomes as well as assembly of putative plasmids and biosynthetic gene clusters from whole‐genome sequencing and metagenomic datasets. In addition, we present guidelines for understanding results with use cases for each pipeline, and several additional support protocols that help in using SPAdes properly. © 2020 Wiley Periodicals LLC.Basic Protocol 1: Assembling isolate bacterial datasetsBasic Protocol 2: Assembling metagenomic datasetsBasic Protocol 3: Assembling sets of putative plasmidsBasic Protocol 4: Assembling transcriptomesBasic Protocol 5: Assembling putative biosynthetic gene clustersSupport Protocol 1: Installing SPAdesSupport Protocol 2: Providing input via command lineSupport Protocol 3: Providing input data via YAML formatSupport Protocol 4: Restarting previous runSupport Protocol 5: Determining strand‐specificity of RNA‐seq data

Từ khóa


Tài liệu tham khảo

Antipov D., 2016, PlasmidSPAdes: Assembling plasmids from whole genome sequencing data, Bioinformatics, 32, 3380, 10.1093/bioinformatics/btw493

10.1093/bioinformatics/btv688

10.1101/gr.241299.118

10.1093/gigascience/giz100

10.1093/bioinformatics/btw218

10.1371/journal.pone.0185056

10.1093/bioinformatics/btt310

10.1093/bioinformatics/bts635

10.1089/cmb.2018.0152

10.1093/bioinformatics/btt086

10.1093/gigascience/giz039

10.1038/nmeth.3317

10.1016/j.mib.2007.08.005

10.1093/bioinformatics/btp352

10.1093/bioinformatics/btt273

10.1101/gr.243477.118

10.1093/bioinformatics/btv697

10.1093/bioinformatics/bts277

10.1186/1471-2164-14-S1-S7

10.1089/cmb.2013.0084

10.1101/gr.213959.116

Prjibelski A. D., 2019, Extending rnaSPAdes functionality for hybrid transcriptome assembly, BMC Bioinformatics, 20, 2

10.1093/bioinformatics/btu266

10.1016/B978-0-12-809633-8.20106-4

10.1101/gr.196469.115

10.1093/bioinformatics/btv337

10.1093/bioinformatics/bts356

10.1093/bioinformatics/btv383

Antipovet al. (2015). See above.

Antipovet al. (2016). See above.

Antipovet al. (2019). See above.

Bushmanovaet al. (2019). See above.

Bushmanovaet al. (2020). See above.

Ershovet al. (2019). See above.

Meleshkoet al. (2019). See above.

Nikolenkoet al. (2013). See above.

Nurket al. (2013). See above.

Nurket al. (2017). See above.

Prjibelskiet al. (2014). See above.

Vasilinetcet al. (2015). See above.

http://cab.spbu.ru/software/spades/

https://github.com/ablab/spades