EzTaxon: a web-based tool for the identification of prokaryotes based on 16S ribosomal RNA gene sequences

International Journal of Systematic and Evolutionary Microbiology - Tập 57 Số 10 - Trang 2259-2261 - 2007
Jongsik Chun1,2, Jae‐Hak Lee1, Yoonyoung Jung1, Myung-Jin Kim2, Seil Kim2, Byung Kwon Kim2, Young-Woon Lim2
1Interdisciplinary Program in Bioinformatics, Seoul National University, 56-1 Shillim-dong, Kwanak-gu, Seoul 151-742, Republic of Korea
2School of Biological Sciences and Institute of Microbiology, Seoul National University, 56-1 Shillim-dong, Kwanak-gu, Seoul 151-742, Republic of Korea

Tóm tắt

16S rRNA gene sequences have been widely used for the identification of prokaryotes. However, the flood of sequences of non-type strains and the lack of a peer-reviewed database for 16S rRNA gene sequences of type strains have made routine identification of isolates difficult and labour-intensive. In the present study, we generated a database containing 16S rRNA gene sequences of all prokaryotic type strains. In addition, a web-based tool, named EzTaxon, for analysis of 16S rRNA gene sequences was constructed to achieve identification of isolates based on pairwise nucleotide similarity values and phylogenetic inference methods. The system developed provides users with a similarity-based search, multiple sequence alignment and various phylogenetic analyses. All of these functions together with the 16S rRNA gene sequence database of type strains can be successfully used for automated and reliable identification of prokaryotic isolates. The EzTaxon server is freely accessible over the Internet at http://www.eztaxon.org/

Từ khóa


Tài liệu tham khảo

Altschul, 1997, Gapped blast and psi-blast: a new generation of protein database search programs, Nucleic Acids Res, 25, 3389, 10.1093/nar/25.17.3389

Ewing, 1998, Base-calling of automated sequencer traces using phred . II, Error probabilities. Genome Res, 8, 186, 10.1101/gr.8.3.186

Ewing, 1998, Base-calling of automated sequencer traces using phred . I, Accuracy assessment. Genome Res, 8, 175, 10.1101/gr.8.3.175

Felsenstein, 1981, Evolutionary trees from DNA sequences: a maximum likelihood approach, J Mol Evol, 17, 368, 10.1007/BF01734359

Felsenstein, 2005, phylip (Phylogeny Inference Package), version 3.6. Distributed by the author. Department of Genome Sciences

Fitch, 1971, Toward defining the course of evolution: minimum change for a specific tree topology, Syst Zool, 20, 406, 10.2307/2412116

Jeon, 2005, jphydit: a JAVA-based integrated environment for molecular phylogeny of ribosomal RNA sequences, Bioinformatics, 21, 3171, 10.1093/bioinformatics/bti463

Kumar, 2004, mega3: Integrated software for molecular evolutionary genetics analysis and sequence alignment, Brief Bioinform, 5, 150, 10.1093/bib/5.2.150

Myers, 1988, Optimal alignments in linear space, Comput Appl Biosci, 4, 11

Ronquist, 2003, MrBayes 3: bayesian phylogenetic inference under mixed models, Bioinformatics, 19, 1572, 10.1093/bioinformatics/btg180

Rosselló-Mora, 2001, The species concept for prokaryotes, FEMS Microbiol Rev, 25, 39, 10.1016/S0168-6445(00)00040-1

Saitou, 1987, The neighbor-joining method: a new method for reconstructing phylogenetic trees, Mol Biol Evol, 4, 406

Stackebrandt, 2006, Taxonomic parameters revisited: tarnished gold standards, Microbiol Today, 33, 152

Stackebrandt, 1994, Taxonomic note: a place for DNA-DNA reassociation and 16S rRNA sequence analysis in the present species definition in bacteriology, Int J Syst Bacteriol, 44, 846, 10.1099/00207713-44-4-846

Swofford, 2002, paup*: Phylogenetic analysis using parsimony (*and other methods), version 4

Thompson, 1994, clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice, Nucleic Acids Res, 22, 4673, 10.1093/nar/22.22.4673