Is there an optimum level of diversity in utilization of genetic resources?
Tóm tắt
Capitalizing upon the genomic characteristics of long-term random mating populations, sampling from pre-selected landraces is a promising approach for broadening the genetic base of elite germplasm for quantitative traits.
Genome-enabled strategies for harnessing untapped allelic variation of landraces are currently evolving. The success of such approaches depends on the choice of source material. Thus, the analysis of different strategies for sampling allelic variation from landraces and their impact on population diversity and linkage disequilibrium (LD) is required to ensure the efficient utilization of diversity. We investigated the impact of different sampling strategies on diversity parameters and LD based on high-density genotypic data of 35 European maize landraces each represented by more than 20 individuals. On average, five landraces already captured ~95% of the molecular diversity of the entire dataset. Within landraces, absence of pronounced population structure, consistency of linkage phases and moderate to low LD levels were found. When combining data of up to 10 landraces, LD decay distances decreased to a few kilobases. Genotyping 24 individuals per landrace with 5k SNPs was sufficient for obtaining representative estimates of diversity and LD levels to allow an informed pre-selection of landraces. Integrating results from European with Central and South American landraces revealed that European landraces represent a unique and diverse spectrum of allelic variation. Sampling strategies for harnessing allelic variation from landraces depend on the study objectives. If the focus lies on the improvement of elite germplasm for quantitative traits, we recommend sampling from pre-selected landraces, as it yields a wide range of diversity, allows optimal marker imputation, control for population structure and avoids the confounding effects of strong adaptive alleles.
Tài liệu tham khảo
Alexander DH, Novembre J, Lange K (2009) Fast model-based estimation of ancestry in unrelated individuals. Genome Res 19:1655–1664. doi:10.1101/gr.094052.109
Böhm J, Schipprack W, Utz HF, Melchinger AE (2017) Tapping the genetic diversity of landraces in allogamous crops with doubled haploid lines: a case study from European flint maize. Theor Appl Genet 130:861–873. doi:10.1007/s00122-017-2856-x
Browning BL, Browning SR (2009) A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. Am J Hum Genet 84:210–223. doi:10.1016/j.ajhg.2009.01.005
Chia J-M, Song C, Bradbury PJ, Costich D, de Leon N, Doebley J, Elshire RJ, Gaut B, Geller L, Glaubitz JC, Gore M, Guill KE, Holland J, Hufford MB, Lai J, Li M, Liu X, Lu Y, McCombie R, Nelson R, Poland J, Prasanna BM, Pyhajarvi T, Rong T, Sekhon RS, Sun Q, Tenaillon MI, Tian F, Wang J, Xu X, Zhang Z, Kaeppler SM, Ross-Ibarra J, McMullen MD, Buckler ES, Zhang G, Xu Y, Ware D (2012) Maize HapMap2 identifies extant variation from a genome in flux. Nat Genet 44:803–807. doi:10.1038/ng.2313
Conrad DF, Jakobsson M, Coop G, Wen X, Wall JD, Rosenberg NA, Pritchard JK (2006) A worldwide survey of haplotype variation and linkage disequilibrium in the human genome. Nat Genet 38:1251–1260. doi:10.1038/ng1911
Core Team R (2013) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Dray S, Dufour AB (2007) The ade4 package: implementing the duality diagram for ecologists. J Stat Softw 22:1–20
Dubreuil P, Warburton M, Chastanet M, Hoisington D, Charcosset A (2006) More on the introduction of temperate maize into Europe: large-scale bulk SSR genotyping and new historical elements. Maydica 51:281–291
Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE (2011) A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One 6:e19379. doi:10.1371/journal.pone.0019379
Excoffier L, Smouse PE, Quattro JM (1992) Analysis of molecular variance inferred from metric distances among DNA haplotypes - application to human mitochondrial-DNA restriction data. Genetics 131:479–491
Ganal MW, Durstewitz G, Polley A, Berard A, Buckler ES, Charcosset A, Clarke JD, Graner EM, Hansen M, Joets J, Le Paslier MC, McMullen MD, Montalent P, Rose M, Schön C-C, Sun Q, Walter H, Martin OC, Falque M (2011) A large maize (Zea mays L.) SNP genotyping array: development and germplasm genotyping, and genetic mapping to compare with the B73 reference genome. PLoS One 6:e28334. doi:10.1371/journal.pone.0028334
Goodman MM (1999) Broadening the genetic diversity in maize breeding by use of exotic germplasm. In: Coors JG, Pandey S (eds) The genetics and exploitation of heterosis in crops. ASA, Madison, pp 139–148
Gorjanc G, Jenko J, Hearne SJ, Hickey JM (2016) Initiating maize pre-breeding programs using genomic selection to harness polygenic variation from landrace populations. BMC Genom 17:30. doi:10.1186/s12864-015-2345-z
Gower JC (1966) Some distance properties of latent root and vector methods used in multivariate analysis. Biometrika 53:325–338. doi:10.2307/2333639
Hearne S, Chen C, Buckler E, Mitchell S (2014) Unimputed GbS derived SNPs for maize landrace accessions represented in the SeeD-maize GWAS panel. International Maize and Wheat Improvement Center. http://hdl.handle.net/11529/10034. Accessed 24 March 2016
Hill WG, Robertson A (1968) Linkage disequilibrium in finite populations. Theor Appl Genet 38:226–231. doi:10.1007/BF01245622
Hill WG, Weir BS (1988) Variances and covariances of squared linkage disequilibria in finite populations. Theor Popul Biol 33:54–78. doi:10.1016/0040-5809(88)90004-4
Hufford MB, Xu X, van Heerwaarden J, Pyhajarvi T, Chia JM, Cartwright RA, Elshire RJ, Glaubitz JC, Guill KE, Kaeppler SM, Lai J, Morrell PL, Shannon LM, Song C, Springer NM, Swanson-Wagner RA, Tiffin P, Wang J, Zhang G, Doebley J, McMullen MD, Ware D, Buckler ES, Yang S, Ross-Ibarra J (2012) Comparative population genomics of maize domestication and improvement. Nat Genet 44:808–811. doi:10.1038/ng.2309
Krakowsky MD, Holley R, Deutsch J, Rice J, Blanco MH, Goodman M (2008) Maize allelic diversity project. 50th Maize Genetics Conference, Washington, DC
Lehermeier C, Schön C-C, de los Campos G (2015) Assessment of genetic heterogeneity in structured plant populations using multivariate whole-genome regression models. Genetics 201:323–337. doi:10.1534/genetics.115.177394
Lonnquist JH (1974) Consideration and experiences with recombinations of exotic and Corn Belt maize germplasm. In: Wilkinson D (ed) 29th Report of annual corn sorghum research conference. Am. Seed Trade Assoc, Chicago, pp 102–117
Mantel N (1967) The detection of disease clustering and a generalized regression approach. Cancer Res 27:209–220
Matsuoka Y, Vigouroux Y, Goodman MM, Sanchez GJ, Buckler E, Doebley J (2002) A single domestication for maize shown by multilocus microsatellite genotyping. PNAS 99:6080–6084. doi:10.1073/pnas.052125199
McCouch S, Baute GJ, Bradeen J, Bramel P, Bretting PK, Buckler E, Burke JM, Charest D, Cloutier S, Cole G, Dempewolf H, Dingkuhn M, Feuillet C, Gepts P, Grattapaglia D, Guarino L, Jackson S, Knapp S, Langridge P, Lawton-Rauh A, Lijua Q, Lusty C, Michael T, Myles S, Naito K, Nelson RL, Pontarollo R, Richards CM, Rieseberg L, Ross-Ibarra J, Rounsley S, Hamilton RS, Schurr U, Stein N, Tomooka N, van der Knaap E, van Tassel D, Toll J, Valls J, Varshney RK, Ward J, Waugh R, Wenzl P, Zamir D (2013) Agriculture: feeding the future. Nature 499:23–24. doi:10.1038/499023a
Melchinger AE, Schopp P, Müller D, Schrag TA, Bauer E, Unterseer S, Homann L, Schipprack W, Schön C-C (2017) Safeguarding our genetic resources with libraries of doubled-haploid lines. Genetics 206:1611–1619
Messmer MM, Melchinger AE, Boppenmaier J, Brunklaus-Jung E, Herrmann RG (1992) Relationships among early European maize inbreds: I. Genetic diversity among Flint and Dent lines revealed by RFLPs. Crop Sci 32:1301–1309
Nei M, Li W-H (1979) Mathematical model for studying genetic variation in terms of restriction endonucleases. PNAS 76:5269–5273
Nei M, Tajima F (1981) DNA polymorphism detectable by restriction endonucleases. Genetics 97:145–163
Nielsen R, Slatkin M (2013) An introduction to population genetics: theory and applications. Sinauer Associates, Sunderland
Oettler G, Schnell FW, Utz HF (1976) Die westdeutschen Getreide- und Kartoffelsortimente im Spiegel ihrer Vermehrungsflächen. Eugen Ulmer, Stuttgart
Paradis E, Claude J, Strimmer K (2004) APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20:289–290. doi:10.1093/bioinformatics/btg412
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81:559–575. doi:10.1086/519795
Rebourg C, Chastanet M, Gouesnard B, Welcker C, Dubreuil P, Charcosset A (2003) Maize introduction into Europe: the history reviewed in the light of molecular data. Theor Appl Genet 106:895–903. doi:10.1007/s00122-002-1140-9
Romay MC, Millard MJ, Glaubitz JC, Peiffer JA, Swarts KL, Casstevens TM, Elshire RJ, Acharya CB, Mitchell SE, Flint-Garcia SA, McMullen MD, Holland JB, Buckler ES, Gardner CA (2013) Comprehensive genotyping of the USA national maize inbred seed bank. Genome Biol 14:R55. doi:10.1186/gb-2013-14-6-r55
Romero Navarro JA, Willcox M, Burgueno J, Romay C, Swarts K, Trachsel S, Preciado E, Terron A, Delgado HV, Vidal V, Ortega A, Banda AE, Montiel NOG, Ortiz-Monasterio I, Vicente FS, Espinoza AG, Atlin G, Wenzl P, Hearne S, Buckler ES (2017) A study of allelic diversity underlying flowering-time adaptation in maize landraces. Nat Genet 49:476–480. doi:10.1038/ng.3784
Saghai-Maroof MA, Soliman KM, Jorgensen RA, Allard RW (1984) Ribosomal DNA spacer-length polymorphisms in barley: Mendelian inheritance, chromosomal location, and population dynamics. PNAS 81:8014–8018
Saitou N, Nei M (1987) The neighbor-joining method - a new method for reconstructing phylogenetic trees. Mol Biol Evol 4:406–425
Sood S, Flint-Garcia S, Willcox CM, Holland JB (2014) Mining natural variation for maize improvement: selection on phenotypes and genes. In: Tuberosa R, Graner A, Frison E (eds) Genomics of plant genetic resources, vol 1. Managing, sequencing and mining genetic resources. Springer, Netherlands, pp 615–649
Takuno S, Ralph P, Swarts K, Elshire RJ, Glaubitz JC, Buckler ES, Hufford MB, Ross-Ibarra J (2015) Independent molecular basis of convergent highland adaptation in maize. Genetics 200:1297–1312. doi:10.1534/genetics.115.178327
Tanksley SD, McCouch SR (1997) Seed banks and molecular maps: unlocking genetic potential from the wild. Science 277:1063–1066
Tarter JA, Holland JB (2006) Gains from selection during the development of semiexotic inbred lines from Latin American maize accessions. Maydica 51:15–23
Technow F, Riedelsheimer C, Schrag TA, Melchinger AE (2012) Genomic prediction of hybrid performance in maize with models incorporating dominance and population specific marker effects. Theor Appl Genet 125:1181–1194. doi:10.1007/s00122-012-1905-8
Unterseer S, Bauer E, Haberer G, Seidel M, Knaak C, Ouzunova M, Meitinger T, Strom TM, Fries R, Pausch H, Bertani C, Davassi A, Mayer KFX, Schön C-C (2014) A powerful tool for genome analysis in maize: development and evaluation of the high density 600k SNP genotyping array. BMC Genom 15:823. doi:10.1186/1471-2164-15-823
Unterseer S, Seidel MA, Bauer E, Haberer G, Hochholdinger F, Opitz N, Marcon C, Baruch K, Spannagl M, Mayer KFX, Schön C-C (2017) European Flint reference sequences complement the maize pan-genome. bioRxiv 103747
van Heerwaarden J, Doebley J, Briggs WH, Glaubitz JC, Goodman MM, Sanchez-Gonzalez JJ, Ross-Ibarra J (2011) Genetic signals of origin, spread, and introgression in a large sample of maize landraces. PNAS 108:1088–1092. doi:10.1073/pnas.1013011108
Vigouroux Y, Glaubitz JC, Matsuoka Y, Goodman MM, Sanchez GJ, Doebley J (2008) Population structure and genetic diversity of new world maize races assessed by DNA microsatellites. Am J Bot 95:1240–1253. doi:10.3732/ajb.0800097
Weir BS, Cockerham CC (1984) Estimating F-statistics for the analysis of population structure. Evolution 38:1358–1370. doi:10.2307/2408641
Wilcoxon F (1945) Individual comparisons by ranking methods. Biom Bull 1:80–83. doi:10.2307/3001968
Wilde K, Burger H, Prigge V, Presterl T, Schmidt W, Ouzunova M, Geiger HH (2010) Testcross performance of doubled-haploid lines developed from European flint maize landraces. Plant Breed 129:181–185. doi:10.1111/j.1439-0523.2009.01677.x
Wright S (1978) Evolution and the genetics of populations: variability within and among natural populations. The University of Chicago Press, Chicago
Zhao K, Aranzana MJ, Kim S, Lister C, Shindo C, Tang C, Toomajian C, Zheng H, Dean C, Marjoram P, Nordborg M (2007) An arabidopsis example of association mapping in structured samples. PLoS Genet 3:e4. doi:10.1371/journal.pgen.0030004