Resolving Difficult Phylogenetic Questions: Why More Sequences Are Not Enough

PLoS Biology - Tập 9 Số 3 - Trang e1000602
Hervé Philippe1, Henner Brinkmann1, Dennis V. Lavrov2, D. Timothy J. Littlewood3, Michaël Manuel4, Gert Wörheide5,6, Denis Baurain7
1Département de Biochimie, Centre Robert-Cedergren, Université de Montréal, Montréal, Québec, Canada
2Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa, United States of America
3Department of Zoology, The Natural History Museum, London, United Kingdom
4Université Paris 6, UMR 7138 "Systématique, Adaptation, Evolution" UPMC CNRS IRD MHNH, Paris, France
5Department of Earth and Environmental Sciences, Ludwig-Maximilians-Universität München, München, Germany
6GeoBio-Center, Ludwig-Maximilians-Universität München, München, Germany
7Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Liège, Belgium

Tóm tắt

Từ khóa


Tài liệu tham khảo

H Gee, 2003, Evolution: ending incongruence., Nature, 425, 782, 10.1038/425782a

C. W Dunn, 2008, Broad phylogenomic sampling improves resolution of the animal tree of life., Nature, 452, 745, 10.1038/nature06614

H Philippe, 2009, Phylogenomics revives traditional views on deep animal relationships., Curr Biol, 19, 706, 10.1016/j.cub.2009.02.052

B Schierwater, 2009, Concatenated analysis sheds light on early metazoan evolution and fuels a modern “urmetazoon” hypothesis., PLoS Biol, 7, e1000020, 10.1371/journal.pbio.1000020

H Philippe, 1994, Can the Cambrian explosion be inferred through molecular phylogeny?, Development, 120, S15, 10.1242/dev.1994.Supplement.15

N Saitou, 1986, The number of nucleotides required to determine the branching order of three species, with special reference to the human-chimpanzee-gorilla divergence., J Mol Evol, 24, 189, 10.1007/BF02099966

E Mossel, 2005, How much can evolved characters tell us about the tree that generated them?, 384

J Felsenstein, 1978, Cases in which parsimony or compatibility methods will be positively misleading., Syst Zool, 27, 401, 10.2307/2412923

H Philippe, 1998, How good are deep phylogenetic trees?, Curr Opin Genet Dev, 8, 616, 10.1016/S0959-437X(98)80028-2

D Baurain, 2010, Current approaches to phylogenomic reconstruction., 17

J. H Degnan, 2009, Gene tree discordance, phylogenetic inference and the multispecies coalescent., Trends Ecol Evol, 24, 332, 10.1016/j.tree.2009.01.009

O Jeffroy, 2006, Phylogenomics: the beginning of incongruence?, Trends Genet, 22, 225, 10.1016/j.tig.2006.02.003

H Philippe, 2005, Phylogenomics., Annu Rev Ecol Evol Syst, 36, 541, 10.1146/annurev.ecolsys.35.112202.130205

L Liu, 2008, Estimating species trees using multiple-allele DNA sequence data., Evolution, 62, 2080, 10.1111/j.1558-5646.2008.00414.x

A Kuzniar, 2008, The quest for orthologs: finding the corresponding gene across genomes., Trends Genet, 24, 539, 10.1016/j.tig.2008.08.009

J Felsenstein, 2004, Inferring phylogenies

C Notredame, 2007, Recent evolutions of multiple sequence alignment algorithms., PLoS Comput Biol, 3, e123, 10.1371/journal.pcbi.0030123

W. M Fitch, 2000, Homology: a personal view on some of the problems., Trends Genet, 16, 227, 10.1016/S0168-9525(00)02005-9

S van Dongen, 2000, Graph clustering by flow simulation [PhD dissertation]

L Li, 2003, OrthoMCL: identification of ortholog groups for eukaryotic genomes., Genome Res, 13, 2178, 10.1101/gr.1224503

R. L Tatusov, 2001, The COG database: new developments in phylogenetic classification of proteins from complete genomes., Nucleic Acids Res, 29, 22, 10.1093/nar/29.1.22

F Schreiber, 2009, OrthoSelect: a protocol for selecting orthologous groups in phylogenomics., BMC Bioinformatics, 10, 219, 10.1186/1471-2105-10-219

L. B Koski, 2001, The closest BLAST hit is often not the nearest neighbor., J Mol Evol, 52, 540, 10.1007/s002390010184

K. M Haen, 2007, Glass sponges and bilaterian animals share derived mitochondrial genomic features: a common ancestry or parallel evolution?, Mol Biol Evol, 24, 1518, 10.1093/molbev/msm070

M Kobayashi, 1996, Early evolution of the Metazoa and phylogenetic status of diploblasts as inferred from amino acid sequence of elongation factor-1 alpha., Mol Phylogenet Evol, 5, 414, 10.1006/mpev.1996.0036

M Medina, 2001, Evaluating hypotheses of basal animal phylogeny using complete sequences of large and small subunit rRNA., Proc Natl Acad Sci U S A, 98, 9707, 10.1073/pnas.171316998

A Rokas, 2003, Conflicting phylogenetic signals at the base of the metazoan tree., Evol Dev, 5, 346, 10.1046/j.1525-142X.2003.03042.x

E. A Sperling, 2009, Phylogenetic-signal dissection of nuclear housekeeping genes supports the paraphyly of sponges and the monophyly of Eumetazoa., Mol Biol Evol, 26, 2261, 10.1093/molbev/msp148

N Galtier, 2007, A model of horizontal gene transfer and the bacterial phylogeny problem., Syst Biol, 56, 633, 10.1080/10635150701546231

C Brochier, 2002, Eubacterial phylogeny based on translational apparatus proteins., Trends Genet, 18, 1, 10.1016/S0168-9525(01)02522-7

S. L Dellaporta, 2006, Mitochondrial genome of Trichoplax adhaerens supports placozoa as the basal lower metazoan phylum., Proc Natl Acad Sci U S A, 103, 8751, 10.1073/pnas.0602076103

K. S Pick, 2010, Improved phylogenomic taxon sampling noticeably affects nonbilaterian relationships., Mol Biol Evol, 27, 1983, 10.1093/molbev/msq089

M. D Hendy, 1989, A framework for the quantitative study of evolutionary trees., Syst Zool, 38, 297, 10.2307/2992396

D Baurain, 2007, Lack of resolution in the animal phylogeny: closely spaced cladogeneses or undetected systematic errors?, Mol Biol Evol, 24, 6, 10.1093/molbev/msl137

D. M Hillis, 1998, Taxonomic sampling, phylogenetic accuracy, and investigator bias., Syst Biol, 47, 3, 10.1080/106351598260987

J. J Wiens, 2005, Can incomplete taxa rescue phylogenetic analyses from long-branch attraction?, Syst Biol, 54, 731, 10.1080/10635150500234583

D. J Zwickl, 2002, Increased taxon sampling greatly reduces phylogenetic error., Syst Biol, 51, 588, 10.1080/10635150290102339

A. R Lemmon, 2009, The effect of ambiguous data on phylogenetic estimates obtained by maximum likelihood and Bayesian inference., Syst Biol, 58, 130, 10.1093/sysbio/syp017

S Hartmann, 2008, Using ESTs for phylogenomics: can one accurately infer a phylogenetic tree from a gappy alignment?, BMC Evol Biol, 8, 95, 10.1186/1471-2148-8-95

H Philippe, 2004, Phylogenomics of eukaryotes: impact of missing data on large alignments., Mol Biol Evol, 21, 1740, 10.1093/molbev/msh182

J. J Wiens, 2003, Missing data, incomplete taxa, and phylogenetic accuracy., Syst Biol, 52, 528, 10.1080/10635150390218330

J. J Wiens, 2008, Missing data and the accuracy of Bayesian phylogenetics., J Syst Evol, 46, 307

T. H Jukes, 1969, Evolution of protein molecules., 21

S Whelan, 2001, A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach., Mol Biol Evol, 18, 691, 10.1093/oxfordjournals.molbev.a003851

N Lartillot, 2004, A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process., Mol Biol Evol, 21, 1095, 10.1093/molbev/msh112

A Hejnol, 2009, Assessing the root of bilaterian animals with scalable phylogenomic methods., Proc Biol Sci, 276, 4261, 10.1098/rspb.2009.0896

H Philippe, 1994, Comparison of molecular and paleontological data in diatoms suggests a major gap in the fossil record., J Evol Biol, 7, 247, 10.1046/j.1420-9101.1994.7020247.x

K Meusemann, 2010, A phylogenomic approach to resolve the arthropod tree of life., Mol Biol Evol, 27, 2541, 10.1093/molbev/msq130

F Delsuc, 2005, Phylogenomics and the reconstruction of the tree of life., Nat Rev Genet, 6, 361, 10.1038/nrg1603

Z Yang, 1996, Maximum-likelihood models for combined analyses of multiple sequence data., J Mol Evol, 42, 587, 10.1007/BF02352289

A Kupczok, 2010, Accuracy of phylogeny reconstruction methods combining overlapping gene data sets., Algorithms Mol Biol, 5, 37, 10.1186/1748-7188-5-37

R. K Bradley, 2009, Fast statistical alignment., PLoS Comput Biol, 5, e1000392, 10.1371/journal.pcbi.1000392

J Castresana, 2000, Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis., Mol Biol Evol, 17, 540, 10.1093/oxfordjournals.molbev.a026334

B Roure, 2007, SCaFoS: a tool for selection, concatenation and fusion of sequences for phylogenomics., BMC Evol Biol, 7, S2, 10.1186/1471-2148-7-S1-S2

E Bapteste, 2002, The analysis of 100 genes supports the grouping of three highly divergent amoebae: <italic>Dictyostelium</italic>, <italic>Entamoeba</italic>, and <italic>Mastigamoeba</italic>., Proc Natl Acad Sci U S A, 99, 1414, 10.1073/pnas.032662799

D. M Robinson, 2003, Protein evolution with dependence among codons due to tertiary structure., Mol Biol Evol, 20, 1692, 10.1093/molbev/msg184

N Rodrigue, 2006, Assessing site-interdependent phylogenetic models of sequence evolution., Mol Biol Evol, 23, 1762, 10.1093/molbev/msl041

B. D Redelings, 2005, Joint Bayesian estimation of alignment and phylogeny., Syst Biol, 54, 401, 10.1080/10635150590947041

G Lunter, 2005, Bayesian coestimation of phylogeny and sequence alignment., BMC Bioinformatics, 6, 83, 10.1186/1471-2105-6-83

N Lartillot, 2006, Conjugate Gibbs sampling for Bayesian phylogenetic models., J Comput Biol, 13, 43, 10.1089/cmb.2006.13.1701

A Stamatakis, 2008, Efficient computation of the phylogenetic likelihood function on multi-gene alignments and multi-core architectures., Philos Trans R Soc Lond B Biol Sci, 363, 3977, 10.1098/rstb.2008.0163

A. P de Koning, 2010, Rapid likelihood analysis on large phylogenies using partial sampling of substitution histories., Mol Biol Evol, 27, 249, 10.1093/molbev/msp228

J Felsenstein, 1981, Evolutionary trees from DNA sequences: a maximum likelihood approach., J Mol Evol, 17, 368, 10.1007/BF01734359

C Lanave, 1984, A new method for calculating evolutionary substitution rates., J Mol Evol, 20, 86, 10.1007/BF02101990

M. O Dayhoff, 1972, A model of evolutionary change in proteins., 89

N Galtier, 1995, Inferring phylogenies from DNA sequences of unequal base compositions., Proc Natl Acad Sci U S A, 92, 11317, 10.1073/pnas.92.24.11317

Z Yang, 1995, On the use of nucleic acid sequences to infer early branchings in the tree of life., Mol Biol Evol, 12, 451

Z Yang, 1996, Among-site rate variation and its impact on phylogenetic analyses., Trends Ecol Evol, 11, 367, 10.1016/0169-5347(96)10041-0

B Kolaczkowski, 2008, A mixed branch length model of heterotachy improves phylogenetic accuracy., Mol Biol Evol, 25, 1054, 10.1093/molbev/msn042

N Rodriguez-Ezpeleta, 2007, Detecting and overcoming systematic errors in genome-scale phylogenies., Syst Biol, 56, 389, 10.1080/10635150701397643

H Nishihara, 2007, Rooting the eutherian tree: the power and pitfalls of phylogenomics., Genome Biol, 8, R199, 10.1186/gb-2007-8-9-r199

S Blanquart, 2008, A site- and time-heterogeneous model of amino acid replacement., Mol Biol Evol, 25, 842, 10.1093/molbev/msn018

C Than, 2009, Species tree inference by minimizing deep coalescences., PLoS Comput Biol, 5, e1000501, 10.1371/journal.pcbi.1000501

E Houliston, 2010, Clytia hemisphaerica: a jellyfish cousin joins the laboratory., Trends Genet, 26, 159, 10.1016/j.tig.2010.01.008

A Stamatakis, 2005, RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees., Bioinformatics, 21, 456, 10.1093/bioinformatics/bti191

N Lartillot, 2009, PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating., Bioinformatics, 25, 2286, 10.1093/bioinformatics/btp368