Estimation of individual admixture: Analytical and study design considerations
Tóm tắt
The genome of an admixed individual represents a mixture of alleles from different ancestries. In the United States, the two largest minority groups, African‐Americans and Hispanics, are both admixed. An understanding of the admixture proportion at an individual level (individual admixture, or IA) is valuable for both population geneticists and epidemiologists who conduct case‐control association studies in these groups. Here we present an extension of a previously described frequentist (maximum likelihood or ML) approach to estimate individual admixture that allows for uncertainty in ancestral allele frequencies. We compare this approach both to prior partial likelihood based methods as well as more recently described Bayesian MCMC methods. Our full ML method demonstrates increased robustness when compared to an existing partial ML approach. Simulations also suggest that this frequentist estimator achieves similar efficiency, measured by the mean squared error criterion, as Bayesian methods but requires just a fraction of the computational time to produce point estimates, allowing for extensive analysis (e.g., simulations) not possible by Bayesian methods. Our simulation results demonstrate that inclusion of ancestral populations or their surrogates in the analysis is required by any method of IA estimation to obtain reasonable results. Genet. Epidemiol. © 2005 Wiley‐Liss, Inc.
Từ khóa
Tài liệu tham khảo
Dempster AP, 1977, Maximum likelihood from incomplete data via the EM algorithm, JRSS B, 39, 1
Shriver MD, 1997, Ethnic‐affiliation estimation by use of population‐specific DNA markers, Am J Hum Genet, 60, 957
Wang J, 2003, Maximum‐likelihood estimation of admixture proportions from genetic data, Genetics, 164, 747, 10.1093/genetics/164.2.747