RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies

Bioinformatics - Tập 30 Số 9 - Trang 1312-1313 - 2014
Alexandros Stamatakis1
11 Scientific Computing Group, Heidelberg Institute for Theoretical Studies, 69118 Heidelberg and 2Department of Informatics, Institute of Theoretical Informatics, Karlsruhe Institute of Technology, 76128 Karlsruhe, Germany

Tóm tắt

Abstract

Motivation: Phylogenies are increasingly used in all fields of medical and biological research. Moreover, because of the next-generation sequencing revolution, datasets used for conducting phylogenetic analyses grow at an unprecedented pace. RAxML (Randomized Axelerated Maximum Likelihood) is a popular program for phylogenetic analyses of large datasets under maximum likelihood. Since the last RAxML paper in 2006, it has been continuously maintained and extended to accommodate the increasingly growing input datasets and to serve the needs of the user community.

Results: I present some of the most notable new features and extensions of RAxML, such as a substantial extension of substitution models and supported data types, the introduction of SSE3, AVX and AVX2 vector intrinsics, techniques for reducing the memory requirements of the code and a plethora of operations for conducting post-analyses on sets of trees. In addition, an up-to-date 50-page user manual covering all new RAxML options is available.

Availability and implementation: The code is available under GNU GPL at https://github.com/stamatak/standard-RAxML.

Contact:  [email protected]

Supplementary information:  Supplementary data are available at Bioinformatics online.

Từ khóa


Tài liệu tham khảo

Aberer, 2010, Parallelized phylogenetic post-analysis on multi-core architectures, J. Comput. Sci., 1, 107, 10.1016/j.jocs.2010.03.006

Berger, 2010, Accuracy of morphology-based phylogenetic fossil placement under maximum likelihood, International Conference on Computer Systems and Applications (AICCSA), 2010 IEEE/ACS, 1

Berger, 2011, Performance, accuracy, and web server for evolutionary placement of short sequence reads under maximum likelihood, Syst. Biol., 60, 291, 10.1093/sysbio/syr010

Guindon, 2010, New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of phyml 3.0, Syst. Biol., 59, 307, 10.1093/sysbio/syq010

Izquierdo-Carrasco, 2011, Algorithms, data structures, and numerics for likelihood-based phylogenetic inference of huge trees, BMC Bioinformatics, 12, 470, 10.1186/1471-2105-12-470

Le, 2012, Modeling protein evolution with several amino acid replacement matrices depending on site rates, Mol. Biol. Evol., 29, 2921, 10.1093/molbev/mss112

Lewis, 2001, A likelihood approach to estimating phylogeny from discrete morphological character data, Syst. Biol., 50, 913, 10.1080/106351501753462876

Minh, 2013, Ultrafast approximation for phylogenetic bootstrap, Mol. Biol Evol., 30, 1188, 10.1093/molbev/mst024

Pattengale, 2010, How many bootstrap replicates are necessary?, J. Comput. Biol., 17, 337, 10.1089/cmb.2009.0179

Pattengale, 2011, Uncovering hidden phylogenetic consensus in large data sets, IEEE/ACM Trans. Comput. Biol. Bioinforma., 8, 902, 10.1109/TCBB.2011.28

Pfeiffer, 2010, Hybrid mpi/pthreads parallelization of the raxml phylogenetics code, International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE, 1, 10.1109/IPDPSW.2010.5470900

Salichos, 2013, Inferring ancient divergences requires genes with strong phylogenetic signals, Nature, 497, 327, 10.1038/nature12130

Stamatakis, 2006, Raxml-vi-hpc: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models, Bioinformatics, 22, 2688, 10.1093/bioinformatics/btl446

Stamatakis, 2013, Novel parallelization schemes for large-scale likelihood-based phylogenetic inference, IEEE 27th International Symposium on Parallel Distributed Processing (IPDPS), 2013, 1195

Stamatakis, 2008, A rapid bootstrap algorithm for the raxml web servers, Syst. Biol., 57, 758, 10.1080/10635150802429642