RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies
Tóm tắt
Motivation: Phylogenies are increasingly used in all fields of medical and biological research. Moreover, because of the next-generation sequencing revolution, datasets used for conducting phylogenetic analyses grow at an unprecedented pace. RAxML (Randomized Axelerated Maximum Likelihood) is a popular program for phylogenetic analyses of large datasets under maximum likelihood. Since the last RAxML paper in 2006, it has been continuously maintained and extended to accommodate the increasingly growing input datasets and to serve the needs of the user community.
Results: I present some of the most notable new features and extensions of RAxML, such as a substantial extension of substitution models and supported data types, the introduction of SSE3, AVX and AVX2 vector intrinsics, techniques for reducing the memory requirements of the code and a plethora of operations for conducting post-analyses on sets of trees. In addition, an up-to-date 50-page user manual covering all new RAxML options is available.
Availability and implementation: The code is available under GNU GPL at https://github.com/stamatak/standard-RAxML.
Contact: [email protected]
Supplementary information: Supplementary data are available at Bioinformatics online.
Từ khóa
Tài liệu tham khảo
Aberer, 2010, Parallelized phylogenetic post-analysis on multi-core architectures, J. Comput. Sci., 1, 107, 10.1016/j.jocs.2010.03.006
Berger, 2010, Accuracy of morphology-based phylogenetic fossil placement under maximum likelihood, International Conference on Computer Systems and Applications (AICCSA), 2010 IEEE/ACS, 1
Berger, 2011, Performance, accuracy, and web server for evolutionary placement of short sequence reads under maximum likelihood, Syst. Biol., 60, 291, 10.1093/sysbio/syr010
Guindon, 2010, New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of phyml 3.0, Syst. Biol., 59, 307, 10.1093/sysbio/syq010
Izquierdo-Carrasco, 2011, Algorithms, data structures, and numerics for likelihood-based phylogenetic inference of huge trees, BMC Bioinformatics, 12, 470, 10.1186/1471-2105-12-470
Le, 2012, Modeling protein evolution with several amino acid replacement matrices depending on site rates, Mol. Biol. Evol., 29, 2921, 10.1093/molbev/mss112
Lewis, 2001, A likelihood approach to estimating phylogeny from discrete morphological character data, Syst. Biol., 50, 913, 10.1080/106351501753462876
Minh, 2013, Ultrafast approximation for phylogenetic bootstrap, Mol. Biol Evol., 30, 1188, 10.1093/molbev/mst024
Pattengale, 2010, How many bootstrap replicates are necessary?, J. Comput. Biol., 17, 337, 10.1089/cmb.2009.0179
Pattengale, 2011, Uncovering hidden phylogenetic consensus in large data sets, IEEE/ACM Trans. Comput. Biol. Bioinforma., 8, 902, 10.1109/TCBB.2011.28
Pfeiffer, 2010, Hybrid mpi/pthreads parallelization of the raxml phylogenetics code, International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE, 1, 10.1109/IPDPSW.2010.5470900
Salichos, 2013, Inferring ancient divergences requires genes with strong phylogenetic signals, Nature, 497, 327, 10.1038/nature12130
Stamatakis, 2006, Raxml-vi-hpc: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models, Bioinformatics, 22, 2688, 10.1093/bioinformatics/btl446
Stamatakis, 2013, Novel parallelization schemes for large-scale likelihood-based phylogenetic inference, IEEE 27th International Symposium on Parallel Distributed Processing (IPDPS), 2013, 1195