Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data

PLoS Genetics - Tập 5 Số 10 - Trang e1000695
Ryan N. Gutenkunst1, Ryan D. Hernandez2, Scott Williamson3, Carlos D. Bustamante3
1Theoretical Biology and Biophysics and Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico, USA.
2Human Genetics, University of Chicago, Chicago, Illinois, United States of America
3Biological Statistics and Computational Biology, Cornell University, #N##TAB##TAB##TAB#Ithaca, New York, United States of America

Tóm tắt

Từ khóa


Tài liệu tham khảo

P Mellars, 2006, Going east: new genetic and archaeological perspectives on the modern human colonization of Eurasia., Science, 313, 796, 10.1126/science.1128402

T Goebel, 2008, The late Pleistocene dispersal of modern humans in the Americas., Science, 319, 1497, 10.1126/science.1153569

R Nielsen, 2007, Recent and ongoing selection in the human genome., Nat Rev Genet, 8, 857, 10.1038/nrg2187

AM Adams, 2004, Maximum-likelihood estimation of demographic parameters using the frequency spectrum of unlinked single-nucleotide polymorphisms., Genetics, 168, 1699, 10.1534/genetics.104.030171

GT Marth, 2004, The allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations., Genetics, 166, 351, 10.1534/genetics.166.1.351

BF Voight, 2005, Interrogating multiple aspects of variation in a full resequencing data set to infer human population size changes., Proc Natl Acad Sci USA, 102, 18508, 10.1073/pnas.0507325102

J Hey, 2005, On the number of New World founders: a population genetic portrait of the peopling of the Americas., PLoS Biol, 3, e193, 10.1371/journal.pbio.0030193

SF Schaffner, 2005, Calibrating a coalescent simulation of human genome sequence variation., Genome Res, 15, 1576, 10.1101/gr.3709305

C Becquet, 2007, A new approach to estimate parameters of speciation models with application to apes., Genome Res, 17, 1505, 10.1101/gr.6409707

AL Caicedo, 2007, Genome-wide patterns of nucleotide polymorphism in domesticated rice., PLoS Genet, 3, 1745, 10.1371/journal.pgen.0030163

A Keinan, 2007, Measurement of the human allele frequency spectrum demonstrates greater genetic drift in East Asians than in Europeans., Nat Genet, 39, 1251, 10.1038/ng2116

D Garrigan, 2007, Inferring human population sizes, divergence times and rates of gene flow from mitochondrial, X and Y chromosome resequencing data., Genetics, 177, 2195, 10.1534/genetics.107.077495

CJ Mulligan, 2008, Updated three-stage model for the peopling of the Americas., PLoS ONE, 3, e3199, 10.1371/journal.pone.0003199

A Kitchen, 2008, A three-stage colonization model for the peopling of the Americas., PLoS ONE, 3, e1596, 10.1371/journal.pone.0001596

M Cox, 2008, Intergenic DNA sequences from the human X chromosome reveal high rates of global gene flow., BMC Genetics, 9, 76, 10.1186/1471-2156-9-76

AJ Drummond, 2005, Bayesian coalescent inference of past population dynamics from molecular sequences., Mol Biol Evol, 22, 1185, 10.1093/molbev/msi103

RD Hernandez, 2007, Context dependence, ancestral misidentification, and spurious signatures of natural selection., Mol Biol Evol, 24, 1792, 10.1093/molbev/msm108

R Nielsen, 2009, Darwinian and demographic forces affecting human protein coding genes., 10.1101/gr.088336.108

J Hey, 2004, Multilocus methods for estimating population sizes, migration rates and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis., Genetics, 167, 747, 10.1534/genetics.103.024182

SA Sawyer, 1992, Population genetics of polymorphism and divergence., Genetics, 132, 1161, 10.1093/genetics/132.4.1161

CD Bustamante, 2001, Directional selection and the site-frequency spectrum., Genetics, 159, 1779, 10.1093/genetics/159.4.1779

J Wakeley, 2008, Coalescent Theory: an Introduction

SH Williamson, 2005, Simultaneous inference of selection and population growth from patterns of variation in the human genome., Proc Natl Acad Sci USA, 102, 7882, 10.1073/pnas.0502300102

RD Hernandez, 2007, Demographic histories and patterns of linkage disequilibrium in Chinese and Indian rhesus macaques., Science, 316, 240, 10.1126/science.1140462

C Wiuf, 2006, Consistency of estimators of population scaled parameters using composite likelihood., J Math Biol, 53, 821, 10.1007/s00285-006-0031-0

L Zhu, 2005, A composite-likelihood approach for detecting directional selection from DNA sequence data., Genetics, 170, 1411, 10.1534/genetics.104.035097

RJ Livingston, 2004, Pattern of sequence variation across 213 environmental response genes., Genome Res, 14, 1821, 10.1101/gr.2730004

RA Fischer, 1922, On the dominance ratio., Proc Roy Soc Edin, 55, 399

M Kimura, 1964, Diffusion models in population genetics., J Appl Probab, 1, 177, 10.1017/S0021900200108368

WJ Ewens, 2000, Mathematical Population Genetics: I. Theoretical Introduction

GA Watterson, 1975, On the number of segregating sites in genetical models without recombination., Theor Popul Biol, 7, 256, 10.1016/0040-5809(75)90020-9

T Nagylaki, 1980, The strong-migration limit in geographically structured populations., J Math Biol, 9, 101, 10.1007/BF00275916

AG Clark, 2005, Ascertainment bias in studies of human genome-wide polymorphism., Genome Res, 15, 1496, 10.1101/gr.4107905

R Nielsen, 2004, Reconstituting the frequency spectrum of ascertained single-nucleotide polymorphism data., Genetics, 168, 2373, 10.1534/genetics.104.031039

RR Hudson, 2002, Generating samples under a Wright-Fisher neutral model of genetic variation., Bioinformatics, 18, 337, 10.1093/bioinformatics/18.2.337

WH Press, 2007, Numerical Recipes: The Art of Scientific Computing

JS Chang, 1970, A practical difference scheme for Fokker-Planck equations., J Comput Phys, 6, 1, 10.1016/0021-9991(70)90001-X

TE Oliphant, 2006, Guide to NumPy

TE Oliphant, 2007, Python for scientific computing., Comput Sci Eng, 9, 10, 10.1109/MCSE.2007.58

JD Hunter, 2007, Matplotlib: a 2D graphics environment., Comput Sci Eng, 9, 90, 10.1109/MCSE.2007.55

NIEHS Environmental Genome Project.

JM Akey, 2004, Population history and natural selection shape patterns of genetic variation in 132 genes., PLoS Biol, 2, e286, 10.1371/journal.pbio.0020286

2005, Initial sequence of the chimpanzee genome and comparison with the human genome., Nature, 437, 69, 10.1038/nature04072

DG Hwang, 2004, Bayesian Markov chain Monte Carlo sequence analysis reveals varying neutral substitution patterns in mammalian evolution., Proc Natl Acad Sci USA, 101, 13994, 10.1073/pnas.0404142101

S Kumar, 2005, Placing confidence limits on the molecular age of the human-chimpanzee divergence., Proc Natl Acad Sci USA, 102, 18842, 10.1073/pnas.0509585102

AS Kondrashov, 2002, Direct estimates of human per nucleotide mutation rates at 20 loci causing Mendelian diseases., Hum Mutat, 21, 12, 10.1002/humu.10147

JN Fenner, 2005, Cross-cultural estimation of the human generation interval for use in genetics-based population divergence studies., Am J Phys Anthropol, 128, 415, 10.1002/ajpa.20188

M Tremblay, 2000, New estimates of intergenerational time intervals for the calculation of age and origin of mutations., Am J Hum Genet, 66, 651, 10.1086/302770

AR Boyko, 2008, Assessing the evolutionary impact of amino acid mutations in the human genome., PLoS Genet, 4, e1000083, 10.1371/journal.pgen.1000083

JG Heinrich, 2001, Can the likelihood-function value be used to measure goodness of fit?

AL Price, 2007, A genomewide admixture map for Latino populations., Am J Hum Genet, 80, 1024, 10.1086/518313

JK Pritchard, 2000, Inference of population structure using multilocus genotype data., Genetics, 155, 945, 10.1093/genetics/155.2.945

N Patterson, 2004, Methods for high-density admixture mapping of disease genes., Am J Hum Genet, 74, 979, 10.1086/420871

GV Kryukov, 2009, Power of deep, all-exon resequencing for discovery of human trait genes., Proc Natl Acad Sci USA, 106, 3871, 10.1073/pnas.0812824106

ME Weale, 2002, Y chromosome evidence for Anglo-Saxon mass migration., Mol Biol Evol, 19, 1008, 10.1093/oxfordjournals.molbev.a004160

JZ Li, 2008, Worldwide human relationships inferred from genome-wide patterns of variation., Science, 319, 1100, 10.1126/science.1153717

M Jakobsson, 2008, Genotype, haplotype and copy-number variation in worldwide human populations., Nature, 451, 998, 10.1038/nature06742

JD Wall, 2008, A novel DNA sequence database for analyzing human demographic history., Genome Res, 18, 1354, 10.1101/gr.075630.107

JM Braverman, 1995, The hitchhiking effect on the site frequency spectrum of DNA polymorphisms., Genetics, 140, 783, 10.1093/genetics/140.2.783

S Myers, 2008, Can one learn history from the allelic spectrum?, Theor Popul Biol, 73, 342, 10.1016/j.tpb.2008.01.001

DA Pierce, 1986, Residuals in generalized linear models., J Am Stat Assoc, 81, 977, 10.1080/01621459.1986.10478361