Genomic Selection in Multi-environment Crop Trials
Tóm tắt
Genomic selection in crop breeding introduces modeling challenges not found in animal studies. These include the need to accommodate replicate plants for each line, consider spatial variation in field trials, address line by environment interactions, and capture nonadditive effects. Here, we propose a flexible single-stage genomic selection approach that resolves these issues. Our linear mixed model incorporates spatial variation through environment-specific terms, and also randomization-based design terms. It considers marker, and marker by environment interactions using ridge regression best linear unbiased prediction to extend genomic selection to multiple environments. Since the approach uses the raw data from line replicates, the line genetic variation is partitioned into marker and nonmarker residual genetic variation (i.e., additive and nonadditive effects). This results in a more precise estimate of marker genetic effects. Using barley height data from trials, in 2 different years, of up to 477 cultivars, we demonstrate that our new genomic selection model improves predictions compared to current models. Analyzing single trials revealed improvements in predictive ability of up to 5.7%. For the multiple environment trial (MET) model, combining both year trials improved predictive ability up to 11.4% compared to a single environment analysis. Benefits were significant even when fewer markers were used. Compared to a single-year standard model run with 3490 markers, our partitioned MET model achieved the same predictive ability using between 500 and 1000 markers depending on the trial. Our approach can be used to increase accuracy and confidence in the selection of the best lines for breeding and/or, to reduce costs by using fewer markers.
Từ khóa
Tài liệu tham khảo
Akaike, 1974, New look at statistical-model identification., Transactions on Automatic Control., AC19, 716, 10.1109/TAC.1974.1100705
Burgueno, 2012, Genomic prediction of breeding values when modeling genotype × environment interaction using pedigree and dense molecular markers., Crop Sci., 52, 707, 10.2135/cropsci2011.06.0299
Butler, 2009, ASReml R-reference manual
Comadran, 2012, Natural variation in a homolog of Antirrhinum CENTRORADIALIS contributed to spring growth habit and environmental adaptation in cultivated barley., Nat. Genet., 44, 1388, 10.1038/ng.2447
Crossa, 2010, Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers., Genetics, 186, 713, 10.1534/genetics.110.118521
Crossa, 2011, Genomic selection and prediction in plant breeding., J. Crop Improv., 25, 239, 10.1080/15427528.2011.558767
Cullis, 1998, Spatial analysis of multi-environment early generation trials., Biometrics, 54, 1, 10.2307/2533991
Cullis, 2006, On the design of early generation variety trials with correlated data., J. Agric. Biol. Environ. Stat., 11, 381, 10.1198/108571106X154443
Da, 2014, Mixed model methods for genomic prediction and variance component estimation of additive and dominance effects using SNP markers., PLoS One, 9, e87666, 10.1371/journal.pone.0087666
de Boer, 1993, Genetic evaluation methods for populations with dominance and inbreeding., Theor. Appl. Genet., 86, 245, 10.1007/BF00222086
de los Campos, 2013, Whole-genome regression and prediction methods applied to plant and animal breeding., Genetics, 193, 327, 10.1534/genetics.112.143313
Desta, 2014, Genomic selection: genome-wide prediction in plant improvement., Trends Plant Sci., 19, 592, 10.1016/j.tplants.2014.05.006
Gilmour, 1997, Accounting for natural and extraneous variation in the analysis of field experiments., J. Agric. Biol. Environ. Stat., 2, 269, 10.2307/1400446
Guo, 2013, Accuracy of across-environment genome wide prediction in maize nested association mapping populations., G3 (Bethesda), 3, 263, 10.1534/g3.112.005066
Habier, 2007, The impact of genetic relationship information on genome assisted breeding values., Genetics, 177, 2389, 10.1534/genetics.107.081190
Habier, 2013, Genomic BLUP decoded: a look into the black box of genomic prediction., Genetics, 194, 597, 10.1534/genetics.113.152207
Heffner, 2010, Plant breeding with genomic selection: gain per unit time and cost., Crop Sci., 50, 10.2135/cropsci2009.11.0662
Heffner, 2011, Genomic selection accuracy for grain quality traits in biparental wheat populations., Crop Sci., 51, 2597, 10.2135/cropsci2011.05.0253
Heffner, 2011, Genomic selection accuracy using multifamily prediction models in a wheat breeding program., Plant Genome, 4, 65, 10.3835/plantgenome.2010.12.0029
Heslot, 2012, Genomic selection in plant breeding: a comparison of models., Crop Sci., 52, 146, 10.2135/cropsci2011.06.0297
Jannink, 2010, Genomic selection in plant breeding: from theory to practice., Brief. Funct. Genomics, 9, 166, 10.1093/bfgp/elq001
Jonas, 2013, Does genomic selection have a future in plant breeding?, Trends Biotechnol., 31, 497, 10.1016/j.tibtech.2013.06.003
Kelly, 2007, The accuracy of varietal selection using factor analytic models for multi-environment plant breeding trials., Crop Sci., 47, 1063, 10.2135/cropsci2006.08.0540
Lado, 2013, Increased genomic prediction accuracy in wheat breeding through spatial adjustment of field trial data., G3 (Bethesda), 2, 2015
Ly, 2013, Relatedness and genotype × environment interaction affect prediction accuracies in genomic selection: a study in cassava., Crop Sci., 53, 1312, 10.2135/cropsci2012.11.0653
Massman, 2013, Genomewide selection verses marker-assisted recurrent selection to improve grain yield and stover-quality traits for cellulosic ethanol in maize., Crop Sci., 53, 58, 10.2135/cropsci2012.02.0112
Meuwissen, 2001, Prediction of total genetic value using genome-wide dense marker maps., Genetics, 157, 1819, 10.1093/genetics/157.4.1819
Munoz, 2014, Unravelling additive from nonadditive effects using genomic relationship matrices., Genetics, 198, 1759, 10.1534/genetics.114.171322
Oakey, 2013, Identification of crop cultivars with consistently high lignocellulosic sugar release requires the use of appropriate statistical design and modelling., Biotechnol. Biofuels, 6, 185, 10.1186/1754-6834-6-185
Patterson, 1977, Variability of yields of cereal varieties in U. K. trials., J. Agric. Sci., 89, 238, 10.1017/S002185960002743X
Piepho, 2012, Efficient computation of ridge-regression best linear unbiased prediction in genomic selection in plant breeding., Crop Sci., 52, 1093, 10.2135/cropsci2011.11.0592
R Core Team, 2015 R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Available at: http://www.R-project.org/.
Smith, 2001, Analyzing variety by environment data using multiplicative mixed models and adjustments for spatial field trend., Biometrics, 57, 1138, 10.1111/j.0006-341X.2001.01138.x
Smith, 2005, The analysis of crop cultivar breeding and evaluations trials: an overview of current mixed model approaches., J. Agric. Sci., 143, 1, 10.1017/S0021859605005587
Solberg, 2009, Persistence of accuracy of genome-wide breeding values over generations when including a polygenic effect., Genet. Sel. Evol., 41, 53, 10.1186/1297-9686-41-53
Stranden, 2009, Technical note: derivation of equivalent computing algorithms for genomic predictions and reliabilities of animal merit., J. Dairy Sci., 92, 2971, 10.3168/jds.2008-1929
Tibshirani, 1996, Regression shrinkage and selection via the LASSO., J. R. Stat. Soc., B, 58, 267, 10.1111/j.2517-6161.1996.tb02080.x
Wang, 2012, Comparison of five methods for genomic breeding value estimation for the common dataser of the 15th QTL-MAS workshop., BMC Proc., 6, S13, 10.1186/1753-6561-6-S2-S13
Wimmer, 2013, Genome-wide prediction of traits with different genetic architecture through efficient variable selection., Genetics, 195, 573, 10.1534/genetics.113.150078
Zhong, 2009, Factors affecting accuracy from genomic selection in populations derived from multiple inbred lines: a Barley case study., Genetics, 182, 355, 10.1534/genetics.108.098277