Estimation of individual admixture: Analytical and study design considerations

Genetic Epidemiology - Tập 28 Số 4 - Trang 289-301 - 2005
Hua Tang1, Jie Peng2, Pei Wang2, Neil Risch3,4
1Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington
2Department of Statistics, Stanford University, Stanford, California
3Department of Genetics, Stanford University, Stanford, California
4Division of Research, Kaiser Permanente, Oakland, California

Tóm tắt

Abstract

The genome of an admixed individual represents a mixture of alleles from different ancestries. In the United States, the two largest minority groups, African‐Americans and Hispanics, are both admixed. An understanding of the admixture proportion at an individual level (individual admixture, or IA) is valuable for both population geneticists and epidemiologists who conduct case‐control association studies in these groups. Here we present an extension of a previously described frequentist (maximum likelihood or ML) approach to estimate individual admixture that allows for uncertainty in ancestral allele frequencies. We compare this approach both to prior partial likelihood based methods as well as more recently described Bayesian MCMC methods. Our full ML method demonstrates increased robustness when compared to an existing partial ML approach. Simulations also suggest that this frequentist estimator achieves similar efficiency, measured by the mean squared error criterion, as Bayesian methods but requires just a fraction of the computational time to produce point estimates, allowing for extensive analysis (e.g., simulations) not possible by Bayesian methods. Our simulation results demonstrate that inclusion of ancestral populations or their surrogates in the analysis is required by any method of IA estimation to obtain reasonable results. Genet. Epidemiol. © 2005 Wiley‐Liss, Inc.

Từ khóa


Tài liệu tham khảo

10.1007/s00439-004-1125-7

10.1126/science.296.5566.261b

10.1093/genetics/158.3.1347

10.1086/377138

Dempster AP, 1977, Maximum likelihood from incomplete data via the EM algorithm, JRSS B, 39, 1

10.1007/978-1-4899-4541-9

10.1111/j.1365-294X.2005.02553.x

10.1002/ajpa.1330700404

10.2337/diacare.14.7.618

10.1007/978-0-387-21606-5

10.1086/375613

10.1093/bioinformatics/18.2.337

10.1038/nbt869

10.1016/0304-4149(82)90011-4

10.1093/genetics/127.2.417

10.1086/377137

10.1111/j.1365-294X.2006.02908.x

10.1002/gepi.10319

10.1086/420871

10.1111/j.1365-294X.2004.02396.x

10.1093/bioinformatics/13.3.235

10.1126/science.1078311

10.1086/380416

Shriver MD, 1997, Ethnic‐affiliation estimation by use of population‐specific DNA markers, Am J Hum Genet, 60, 957

10.1086/420856

Wang J, 2003, Maximum‐likelihood estimation of admixture proportions from genetic data, Genetics, 164, 747, 10.1093/genetics/164.2.747

10.1016/S0065-2660(01)42016-5

10.1086/421329

10.1517/phgs.4.4.431.22758