Identifying adaptive genetic divergence among populations from genome scans

Molecular Ecology - Tập 13 Số 4 - Trang 969-980 - 2004
Mark Beaumont1, David J. Balding
1School of Animal and Microbial Sciences, The University of Reading, Whiteknights, PO Box 228, Reading RG6 6AJ, UK. [email protected]

Tóm tắt

AbstractThe identification of signatures of natural selection in genomic surveys has become an area of intense research, stimulated by the increasing ease with which genetic markers can be typed. Loci identified as subject to selection may be functionally important, and hence (weak) candidates for involvement in disease causation. They can also be useful in determining the adaptive differentiation of populations, and exploring hypotheses about speciation. Adaptive differentiation has traditionally been identified from differences in allele frequencies among different populations, summarised by an estimate of FST. Low outliers relative to an appropriate neutral population‐genetics model indicate loci subject to balancing selection, whereas high outliers suggest adaptive (directional) selection. However, the problem of identifying statistically significant departures from neutrality is complicated by confounding effects on the distribution of FST estimates, and current methods have not yet been tested in large‐scale simulation experiments. Here, we simulate data from a structured population at many unlinked, diallelic loci that are predominantly neutral but with some loci subject to adaptive or balancing selection. We develop a hierarchical‐Bayesian method, implemented via Markov chain Monte Carlo (MCMC), and assess its performance in distinguishing the loci simulated under selection from the neutral loci. We also compare this performance with that of a frequentist method, based on moment‐based estimates of FST. We find that both methods can identify loci subject to adaptive selection when the selection coefficient is at least five times the migration rate. Neither method could reliably distinguish loci under balancing selection in our simulations, even when the selection coefficient is twenty times the migration rate.

Từ khóa


Tài liệu tham khảo

10.1101/gr.631202

10.1016/S0040-5809(03)00007-8

10.1007/BF01441146

10.1007/BF02432124

10.1098/rspb.1996.0237

10.1146/annurev.ento.46.1.441

10.1073/pnas.88.3.839

10.1006/tpbi.1995.1025

10.1146/annurev.ge.29.120195.002153

10.1007/978-1-4899-4485-6

Kayser M, 2003, A genome scan to detect candidate regions influenced by local natural selection in human populations, Molecular Biology and Evolution, 20, 893, 10.1093/molbev/msg092

10.1111/j.1365-294X.2006.03195.x

10.1111/j.1461-0248.2007.01028.x

10.1046/j.1365-294X.2003.01783.x

10.1017/S0016672300033607

Singh RS, 1987, A comprehensive study of genic variation in natural populations of Drosophila melanogaster. II. Estimates of heterozygosity and patterns of geographic variation, Genetics, 117, 255, 10.1093/genetics/117.2.255

Schlotterer C, 2002, A microsatellite‐based multilocus screen for the identification of local selective sweeps, Genetics, 160, 753, 10.1093/genetics/160.2.753

SorensenD GianolaD(2002)Likelihood Bayesian and MCMC methods in quantitative genetics.Springer New York.

StorzJF NachmanMW(2003)Natural selection on protein polymorphism in the rodent genus Peromyscus: evidence from interlocus contrasts.Evolution57 2628–2635.

10.1111/j.0014-3820.2002.tb00857.x

10.1111/j.1365-294X.2006.03099.x

10.1111/j.1558-5646.1984.tb05657.x

10.1046/j.1420-9101.2001.00304.x

10.1093/genetics/153.4.1863

10.1093/genetics/159.2.893

10.1111/j.0014-3820.2005.tb00977.x

Wright S, 1969, Evolution and the Genetics of Populations. Volume 2: The Theory of Gene Frequencies

10.1046/j.1420-9101.2001.00335.x