An evaluation of the performance of HapMap SNP data in a Shanghai Chinese population: Analyses of allele frequency, linkage disequilibrium pattern and tagging SNPs transferability on chromosome 1q21-q25

Cheng Hu1, Jia Wang1, Weihua Zhang2,3, Congrong Wang1, Rong Zhang1, Jie Wang1, Xiaojing Ma1, Kun–san Xiang1
1Shanghai Diabetes Institute, Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China.
2Department of Cardiology, Ealing Hospital NHS Trust, Southall, Middlesex, UK
3Section of Cancer Genetics, The Institute of Cancer Research, Belmont, Sutton, Surrey, UK

Tóm tắt

Abstract Background The HapMap project aimed to catalog millions of common single nucleotide polymorphisms (SNPs) in the human genome in four major populations, in order to facilitate association studies of complex diseases. To examine the transferability of Han Chinese in Beijing HapMap data to the Southern Han Chinese in Shanghai, we performed comparative analyses between genotypes from over 4,500 SNPs in a 21 Mb region on chromosome 1q21-q25 in 80 unrelated Shanghai Chinese and 45 HapMap Chinese data. Results Three thousand and forty-two SNPs were analyzed after removal of SNPs that failed quality control and those not in the HapMap panel. We compared the allele frequency distributions, linkage disequilibrium patterns, haplotype frequency distributions and tagging SNP sets transferability between the HapMap population and Shanghai Chinese population. Among the four HapMap populations, Beijing Chinese showed the best correlation with Shanghai population on allele frequencies, linkage disequilibrium and haplotype frequencies. Tagging SNP sets selected from four HapMap populations at different thresholds were evaluated in the Shanghai sample. Under the threshold of r2 equal to 0.8 or 0.5, both HapMap Chinese and Japanese data showed better coverage and tagging efficiency than Caucasian and African data. Conclusion Our study supported the applicability of HapMap Beijing Chinese SNP data to the study of complex diseases among southern Chinese population.

Từ khóa


Tài liệu tham khảo

The International HapMap Consortium: The International HapMap Project. Nature. 2003, 426: 789-796. 10.1038/nature02168.

The International HapMap Consortium: Integrating ethics and science in the International HapMap Project. Nat Rev Genet. 2004, 5: 467-475. 10.1038/nrg1351.

The International HapMap Consortium: A haplotype map of the human genome. Nature. 2005, 437: 1299-1320. 10.1038/nature04226.

Lim J, Kim YJ, Yoon Y, Kim SO, Kang H, Park J, et al: Comparative study of the linkage disequilibrium of an ENCODE region, chromosome 7p15, in Korean, Japanese, and Han Chinese samples. Genomics. 2006, 87: 392-398. 10.1016/j.ygeno.2005.11.002.

Mahasirimongkol S, Chantratita W, Promso S, Pasomsab E, Jinawath N, Jongjaroenprasert W, et al: Similarity of the allele frequency and linkage disequilibrium pattern of single nucleotide polymorphisms in drug-related gene loci between Thai and northern East Asian populations: implications for tagging SNP selection in Thais. J Hum Genet. 2006, 51: 896-904. 10.1007/s10038-006-0041-1.

Montpetit A, Nelis M, Laflamme P, Magi R, Ke X, Remm M, et al: An evaluation of the performance of tag SNPs derived from HapMap in a Caucasian population. PLoS Genet. 2006, 2: e27-10.1371/journal.pgen.0020027.

Mueller JC, Lohmussaar E, Magi R, Remm M, Bettecken T, Lichtner P, et al: Linkage disequilibrium patterns and tagSNP transferability among European populations. Am J Hum Genet. 2005, 76: 387-398. 10.1086/427925.

Ribas G, Gonzalez-Neira A, Salas A, Milne RL, Vega A, Carracedo B, et al: Evaluating HapMap SNP data transferability in a large-scale genotyping project involving 175 cancer-associated genes. Hum Genet. 2006, 118: 669-679. 10.1007/s00439-005-0094-9.

Service S, Sabatti C, Freimer N: Tag SNPs chosen from HapMap perform well in several population isolates. Genet Epidemiol. 2007, 31: 189-194. 10.1002/gepi.20201.

Smith EM, Wang X, Littrell J, Eckert J, Cole R, Kissebah AH, et al: Comparison of linkage disequilibrium patterns between the HapMap CEPH samples and a family-based cohort of Northern European descent. Genomics. 2006, 88: 407-414. 10.1016/j.ygeno.2006.04.004.

Stankovich J, Cox CJ, Tan RB, Montgomery DS, Huxtable SJ, Rubio JP, et al: On the utility of data from the International HapMap Project for Australian association studies. Hum Genet. 2006, 119: 220-222. 10.1007/s00439-005-0120-y.

Willer CJ, Scott LJ, Bonnycastle LL, Jackson AU, Chines P, Pruim R, et al: Tag SNP selection for Finnish individuals based on the CEPH Utah HapMap database. Genet Epidemiol. 2006, 30: 180-190. 10.1002/gepi.20131.

Xiao C, Cavalli-Sforza LL, Minch E, Du R: Principal component analysis of gene frequencies of Chinese populations. Sci China C Life Sci. 2000, 43: 472-481.

Chu JY, Huang W, Kuang SQ, Wang JM, Xu JJ, Chu ZT, et al: Genetic relationship of populations in China. Proc Natl Acad Sci USA. 1998, 95: 11763-11768. 10.1073/pnas.95.20.11763.

Wen B, Li H, Lu D, Song X, Zhang F, He Y, et al: Genetic evidence supports demic diffusion of Han culture. Nature. 2004, 431: 302-305. 10.1038/nature02878.

Zhao TM, Lee TD: Gm and Km allotypes in 74 Chinese populations: a hypothesis of the origin of the Chinese nation. Hum Genet. 1989, 83: 101-110. 10.1007/BF00286699.

Gu S, Pakstis AJ, Li H, Speed WC, Kidd JR, Kidd KK: Significant variation in haplotype block structure but conservation in tagSNP patterns among global populations. Eur J Hum Genet. 2007, 15: 302-312. 10.1038/sj.ejhg.5201751.

Huang W, He Y, Wang H, Wang Y, Liu Y, Wang Y, et al: Linkage disequilibrium sharing and haplotype-tagged SNP portability between populations. Proc Natl Acad Sci USA. 2006, 103: 1418-1421. 10.1073/pnas.0510360103.

Cardon LR, Bell JI: Association study designs for complex diseases. Nat Rev Genet. 2001, 2: 91-99. 10.1038/35052543.

Gordon D, Finch SJ: Factors affecting statistical power in the detection of genetic association. J Clin Invest. 2005, 115: 1408-1418. 10.1172/JCI24756.

Pfeiffer RM, Gail MH: Sample size calculations for population- and family-based case-control association studies on marker genotypes. Genet Epidemiol. 2003, 25: 136-148. 10.1002/gepi.10245.

Zondervan KT, Cardon LR: The complex interplay among factors that influence allelic association. Nat Rev Genet. 2004, 5: 89-100. 10.1038/nrg1270.

Zeggini E, Rayner W, Morris AP, Hattersley AT, Walker M, Hitman GA, et al: An evaluation of HapMap sample size and tagging SNP performance in large-scale empirical and simulated data sets. Nat Genet. 2005, 37: 1320-1322. 10.1038/ng1670.

Wigginton JE, Cutler DJ, Abecasis GR: A note on exact tests of Hardy-Weinberg equilibrium. Am J Hum Genet. 2005, 76: 887-893. 10.1086/429864.

Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, et al: The structure of haplotype blocks in the human genome. Science. 2002, 296: 2225-2229. 10.1126/science.1069424.

Barrett JC, Fry B, Maller J, Daly MJ: Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005, 21: 263-265. 10.1093/bioinformatics/bth457.

Qin ZS, Niu T, Liu JS: Partition-ligation-expectation-maximization algorithm for haplotype inference with single-nucleotide polymorphisms. Am J Hum Genet. 2002, 71: 1242-1247. 10.1086/344207.

Carlson CS, Eberle MA, Rieder MJ, Yi Q, Kruglyak L, Nickerson DA: Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am J Hum Genet. 2004, 74: 106-120. 10.1086/381000.