Distinguishing between hot-spots and melting-pots of genetic diversity using haplotype connectivity

Binh Thanh Nguyen1, Andreas Spillner2, Brent C. Emerson3, Vincent Moulton1
1School of Computing Sciences, University of East Anglia, Norwich NR4 7TJ, UK
2Department of Mathematics and Computer Science, University of Greifswald, 17489, Greifswald, Germany
3School of Biological Sciences, University of East Anglia, Norwich NR4 7TJ, UK

Tóm tắt

Abstract We introduce a method to help identify how the genetic diversity of a species within a geographic region might have arisen. This problem appears, for example, in the context of identifying refugia in phylogeography, and in the conservation of biodiversity where it is a factor in nature reserve selection. Complementing current methods for measuring genetic diversity, we analyze pairwise distances between the haplotypes of a species found in a geographic region and derive a quantity, called haplotype connectivity, that aims to capture how divergent the haplotypes are relative to one another. We propose using haplotype connectivity to indicate whether, for geographic regions that harbor a highly diverse collection of haplotypes, diversity evolved inside a region over a long period of time (a "hot-spot") or is the result of a more recent mixture (a "melting-pot"). We describe how the haplotype connectivity for a collection of haplotypes can be computed efficiently and briefly discuss some related optimization problems that arise in this context. We illustrate the applicability of our method using two previously published data sets of a species of beetle from the genus Brachyderes and a species of tree from the genus Pinus.

Từ khóa


Tài liệu tham khảo

Emerson BC, Hewitt GM: Phylogeography. Curr Biol. 2005, 15: R367-R371. 10.1016/j.cub.2005.05.016

Hewitt GM: Some genetic consequences of ice ages, and their role in divergence and speciation. Biol J Linn Soc. 1996, 58: 247-276.

Hewitt GM: The genetic legacy of the Quarternary ice ages. Nature. 2000, 405: 907-913. 10.1038/35016000

Petit R, Aguinagalde I, Beaulieu J, Bittkau C, Brewer S, Cheddadi R, Ennos R, Fineschi S, Grivet D, Lascoux M, Mohanty A, Müller-Stark G, Demesure-Musch B, Palmée A, Martíin J, Rendell S, Vendramin G: Glacial refugia: hotspots but not melting pots of genetic diversity. Science. 2003, 300: 1563-1565. 10.1126/science.1083264

Desmet PG, Cowling RM, Ellis AG, Pressey RL: Integrating biosystematic data into conservation planning: perspectives from Southern Africa's succulent Karoo. Syst Biol. 2002, 51: 317-330. 10.1080/10635150252899798

Avise JC, Arnold J, Ball RM, Bermingham E, Lamb T, Neigel JE, Reeb CA, Saunders NC: Intraspecific phylogeography: the mitochondrial DNA bridge between population genetics and systematics. Ann Rev Ecol Syst. 1987, 18: 489-522.

Camerini P: The min-max spanning tree problem and some extensions. Inform Process Lett. 1978, 7: 10-14. 10.1016/0020-0190(78)90030-3

Templeton AR: Using phylogeographic analyses of gene trees to test species status and processes. Mol Ecol. 2001, 10: 779-791. 10.1046/j.1365-294x.2001.01199.x

Posada D, Crandall KA, Templeton AR: Nested clade analysis statistics. Mol Ecol Notes. 2006, 6: 590-593. 10.1111/j.1471-8286.2006.01368.x

Templeton AR: Nested clade analyses of phylogeographic data: testing hypothesis about gene flow and population history. Mol Ecol. 1998, 7: 381-397. 10.1046/j.1365-294x.1998.00308.x

Lacey Knowles L: Why does a method that fails continue to be used?. Evolution. 2008, 62: 2713-2717. 10.1111/j.1558-5646.2008.00481.x

Rozenfeld AF, Arnaud-Haond S, Hernández-García E, Eguíluz VM, Matías MA, Serrão E, Duarte CM: Spectrum of genetic diversity and networks of clonal organisms. J R Soc Interface. 2007, 4: 1093-1102. 10.1098/rsif.2007.0230

Excoffier L: Patterns of DNA sequence diversity and genetic structure after a range expansion: lessons from the infinite-island model. Mol Ecol. 2004, 13: 853-864. 10.1046/j.1365-294X.2003.02004.x

Schneider S, Excoffier L: Estimation of past demographic parameters from the distribution of pairwise differences when the mutation rates vary among sites: application to human mitochondrial DNA. Genetics. 1999, 152: 1079-1089.

Nei M, Tajima F: DNA polymorphism detectable by restriction endonucleases. Genetics. 1981, 97: 145-163.

Tamura K, Nei M: Estimation of the number of nucleotide substitutions in the control region of the mitochondrial DNA in humans and chimpanzees. Mol Biol Evol. 1993, 10: 512-526.

Vendramin GG, Anzidei M, Madaghiele A, Bucci G: Distribution of genetic diversity in Pinus pinaster Ait. as revealed by chloroplast microsatellites. Theor Appl Genet. 1998, 97: 456-463. 10.1007/s001220050917

Huson D, Nettles S, Warnow T: Disk-Covering, a Fast-Converging Method for Phylogenetic Tree Reconstruction. J Comput Biol. 1999, 6: 369-386. 10.1089/106652799318337

Berry A, Sigayret A, Sinoquet C: Maximal sub-triangulation in pre-processing phylogenetic data. Soft Computing - A Fusion of Foundations, Methodologies and Applications. 2006, 10: 461-468.

Cormen H, Leiserson CE, Rivest RL, Stein C: Introduction to Algorithms. 2001, The MIT Press

Schönhage A, Paterson M, Pippenger N: Finding the median. J Comput Syst Sci. 1976, 13: 184-199.

Punnen A, Chapovska O: The bottleneck k-MST. Inform Process Lett. 2005, 95: 512-517. 10.1016/j.ipl.2005.05.025

Even S, Tarjan R: Network flow and testing graph connectivity. SIAM J Comput. 1975, 4: 507-518. 10.1137/0204043

Gabow H: Using expander graphs to find vertex connectivity. J ACM. 2006, 53: 800-844. 10.1145/1183907.1183912

Tajima F: The amount of DNA polymorphism maintained in a finite population when neutral mutation rates varies among sites. Genetics. 1996, 143: 1761-1770.

Rauch EM, Bar-Yam Y: Theory predicts the uneven distribution of genetic diversity within species. Nature. 2004, 431: 449-452. 10.1038/nature02745

Rauch EM, Bar-Yam Y: Estimating the total genetic diversity of a spatial field population from a sample and implications of its dependence on habitat area. Proc Natl Acad Sci Unit States Am. 2005, 102: 9826-9829. 10.1073/pnas.0408471102

Faith D: Conservation evaluation and phylogenetic diversity. Biol Conservat. 1992, 61: 1-10. 10.1016/0006-3207(92)91201-3

Pardi F, Goldman N: Species choice for comparative genomics: being greedy works. PLoS Genetics. 2005, 1 (6):

Steel MA: Phylogenetic diversity and the greedy algorithm. Syst Biol. 2005, 54 (4): 527-529. 10.1080/10635150590947023

Spillner A, Nguyen BT, Moulton V: Computing Phylogenetic Diversity for Split Systems. IEEE ACM Trans Comput Biol Bioinformatics. 2008, 5 (2): 235-244. 10.1109/TCBB.2007.70260

Blum A, Chalasani P, Coppersmith D, Pulleyblank WR, Raghavan P, Sudan M: The minimum latency problem. Proc. ACM Symposium on Theory of Computing (STOC). 1994, 163-171.

Minh B, Klaere S, von Haeseler A: Taxon selection under split diversity. Syst Biol. 2009, 58: 586-594. 10.1093/sysbio/syp058

Echt CS, Verno LLD, Anzidei M, Vendramin GG: Chloroplast microsatellites reveal population genetic diversity in red pine, Pinus resinosa Ait. Mol Ecol. 1998, 7: 307-317. 10.1046/j.1365-294X.1998.00350.x

Hansen P, Moon I: Dispersing facilities on a network. TIMS/ORSA Joint National Meeting, Washington, D.C. 1988

Pisinger D: Upper bounds and exact algorithms for p-dispersion problems. Comput Oper Res. 2006, 33: 1380-1398. 10.1016/j.cor.2004.09.033

Emerson B, Forgie S, Goodacre S, Oromi P: Testing phylogeographic predictions on an active volcanic island: Brachyderes rugatus (Coleoptera: Curculionidae) on La Palma (Canary Islands). Mol Ecol. 2006, 15: 449-458. 10.1111/j.1365-294X.2005.02786.x

Clement M, Posada D, Crandall KA: TCS: A computer program to estimate gene genealogies. Mol Ecol. 2000, 9: 1557-1659. 10.1046/j.1365-294x.2000.01020.x

Bucci G, González-Martínez S, le Provost G, Plomion C, Ribeiro M, Sebastiani F, Alía R, Vendramin G: Range-wide phylogeography and gene zones in Pinus pinaster Ait. revealed by chloroplast microsatellite markers. Mol Ecol. 2007, 16: 2137-2153. 10.1111/j.1365-294X.2007.03275.x

Schweiger O, Klotz S, Durka W, Kühn I: A comparative test of phylogenetic diversity indices. Oecologia. 2008, 157: 485-495. 10.1007/s00442-008-1082-2

Regan H, Hierl L, Franklin J, Deutschman D, Schmalbach H, Winchell C, Johnson B: Species prioritization for monitoring and management in regional multiple species conservation plans. Diversity and Distributions. 2007, 14: 462-471. 10.1111/j.1472-4642.2007.00447.x

Hartmann K, Steel M: Phylogenetic diversity: from combinatorics to ecology. Reconstructing evolution: new mathematical and computational approaches. Edited by: Gascuel O, Steel M. 2007, Oxford University Press

Weitzman M: The Noah's Ark Problem. Econometrica. 1998, 66: 1279-1298. 10.2307/2999617