A review of long‐branch attraction
Tóm tắt
The history of long‐branch attraction, and in particular methods suggested to detect and avoid the artifact to date, is reviewed. Methods suggested to avoid LBA‐artifacts include excluding long‐branch taxa, excluding faster evolving third codon positions, using inference methods less sensitive to LBA such as likelihood, the Aguinaldo et al. approach, sampling more taxa to break up long branches and sampling more characters especially of another kind, and the pros and cons of these are discussed. Methods suggested to detect LBA are numerous and include methodological disconcordance, RASA, separate partition analyses, parametric simulation, random outgroup sequences, long‐branch extraction, split decomposition and spectral analysis. Less than 10 years ago it was doubted if LBA occurred in real datasets. Today, examples are numerous in the literature and it is argued that the development of methods to deal with the problem is warranted. A 16 kbp dataset of placental mammals and a morphological and molecular combined dataset of gall waSPS are used to illustrate the particularly common problem of LBA of problematic ingroup taxa to outgroups. The preferred methods of separate partition analysis, methodological disconcordance, and long branch extraction are used to demonstrate detection methods. It is argued that since outgroup taxa almost always represent long branches and are as such a hazard towards misplacing long branched ingroup taxa, phylogenetic analyses should always be run with and without the outgroups included. This will detect whether only the outgroup roots the ingroup or if it simultaneously alters the ingroup topology, in which case previous studies have shown that the latter is most often the worse. Apart from that LBA to outgroups is the major and most common problem; scanning the literature also detected the ill advised comfort of high support values from thousands of characters, but very few taxa, in the age of genomics. Taxon sampling is crucial for an accurate phylogenetic estimate and trust cannot be put on whole mitochondrial or chloroplast genome studies with only a few taxa, despite their high support values. The placental mammal example demonstrates that parsimony analysis will be prone to LBA by the attraction of the tenrec to the distant marsupial outgroups. In addition, the murid rodents, creating the classic “the guinea‐pig is not a rodent” hypothesis in 1996, are also shown to be attracted to the outgroup by nuclear genes, although including the morphological evidence for rodents and Glires overcomes the artifact. The gall wasp example illustrates that Bayesian analyses with a partition‐specific GTR + Γ + I model give a conflicting resolution of clades, with a posterior probability of 1.0 when comparing ingroup alone versus outgroup rooted topologies, and this is due to long‐branch attraction to the outgroup.
© The Willi Hennig Society 2005.
Từ khóa
Tài liệu tham khảo
Bergsten J., 2004, Acilius phylogeny (Coleoptera: Dytiscidae), problems with long‐branch attraction and morphological intersexual coevlution, Cladistics, 20, 76
Cummings M.P., 2003, Comparing bootstrap and posterior probability values in the four‐taxon case, Syst. Biol, 52, 477, 10.1080/10635150390218213
Cummings M.P., 1995, Sampling properties of DNA‐sequence data in phylogenetic analysis, Mol. Biol. Evol, 12, 814
Felsenstein J., 2003, Inferring Phylogenies
Goloboff P., 1999, NONA (NO NAME)
Goloboff P.A., 2004, Cases in which Bayesian phylogenetic analysis will be positively misleading, Cladistics, 20, 83
Huelsenbeck J.P., 1998, Systematic bias in phylogenetic analysis: is the Strepsiptera problem solved?, Syst. Biol, 47, 519
Huelsenbeck J.P., 1996, Parametric Bootstrapping in Molecular Phylogenetics: Application and Performance
Huelsenback J.P., 2004, Frequentist properties of bayesian posterior probabilities of phylogenetic trees under simple and complex models, Syst. Biol, 53, 904, 10.1080/10635150490522629
Lake J.A., 1987, A rate‐independent technique for analysis of nucleic acid sequences: evolutionary parsimony, Mol. Biol. Evol, 4, 167
Nixon K., 2002, WinClada
Page R.D.M., 1998, Molecular Evolution, a Phylogenetic Approach
Pamilo P., 1988, Relationships between gene trees and species trees, Mol. Biol. Evol, 5, 568
Philippe H., 1997, Rodent monophyly: pitfalls of molecular phylogenies, J. Mol. Evol, 45, 712
Philippe H., 1998, How good are deep phylogenetic trees?, Curr. Opi. General Dev, 8, 616, 10.1016/S0959-437X(98)80028-2
Pickett K.M., 2004, Do Bayesian support values reflect probability of the truth?, Cladistics, 20, 92
Ronquist F., 2001, Evolution of the gall wasp–host plant association, Evolution, 55, 2503
Schuh R.T., 2000, Biological Systematics Principles and Applications
Swofford D.L., 1996, Phylogenetic Inference., 407
Van Tuinen M., 2000, The early history of modern birds inferred from DNA sequences of nuclear and mitochondrial ribosomal genes, Mol. Biol. Evol, 17, 451, 10.1093/oxfordjournals.molbev.a026324
Waddell P.J., 2001, A phylogenetic foundation for comparative mammalian genomics, Genome Informatics, 12, 141
Whiting M.F., 1997, The Strepsiptera problem: phylogeny of the holometabolous insect orders inferred from 18S and 28S ribosomal DNA sequences and morphology, Syst. Biol, 46, 1