The 3,000 rice genomes project
Tóm tắt
Rice, Oryza sativa L., is the staple food for half the world’s population. By 2030, the production of rice must increase by at least 25% in order to keep up with global population growth and demand. Accelerated genetic gains in rice improvement are needed to mitigate the effects of climate change and loss of arable land, as well as to ensure a stable global food supply. We resequenced a core collection of 3,000 rice accessions from 89 countries. All 3,000 genomes had an average sequencing depth of 14×, with average genome coverages and mapping rates of 94.0% and 92.5%, respectively. From our sequencing efforts, approximately 18.9 million single nucleotide polymorphisms (SNPs) in rice were discovered when aligned to the reference genome of the temperate japonica variety, Nipponbare. Phylogenetic analyses based on SNP data confirmed differentiation of the O. sativa gene pool into 5 varietal groups – indica, aus/boro, basmati/sadri, tropical japonica and temperate japonica. Here, we report an international resequencing effort of 3,000 rice genomes. This data serves as a foundation for large-scale discovery of novel alleles for important rice phenotypes using various bioinformatics and/or genetic approaches. It also serves to understand the genomic diversity within O. sativa at a higher level of detail. With the release of the sequencing data, the project calls for the global rice community to take advantage of this data as a foundation for establishing a global, public rice genetic/genomic database and information platform for advancing rice breeding technology for future rice improvement.
Tài liệu tham khảo
Li ZK, Rutger JN: Geographic distribution and multilocus organization of isozyme variation of rice (Oryza sativa L.). Theor Appl Genet. 2000, 101: 379-387. 10.1007/s001220051494.
Yu SB, Xu WJ, Vijayakumar CHM, Ali J, Fu BY, Xu JL, Marghirang R, Domingo J, Jiang YZ, Aquino C, Virmani SS, Li ZK: Molecular diversity and multilocus organization of the parental lines used in the International Rice Molecular Breeding Program. Theor Appl Genet. 2003, 108: 131-140. 10.1007/s00122-003-1400-3.
Seck PA, Diagne A, Mohanty S, Wopereis CS: Crops that feed the world 7: Rice. Food Sec. 2012, 4: 7-24. 10.1007/s12571-012-0168-1.
Li ZK, Zhang F: Rice breeding in the post-genomics era: from concept to practice. Curr Opin Plant Biol. 2013, 16: 1-9. 10.1016/j.pbi.2013.01.002.
Kilian B, Graner A: NGS technologies for analyzing germplasm diversity in genebanks. Brief Funct Genomics. 2012, 11: 38-50. 10.1093/bfgp/elr046.
McCouch S, McNally KL, Wang W, Hamilton RS: Genomics of gene banks: A case study in rice. Am J Bot. 2012, 99: 407-423. 10.3732/ajb.1100385.
Xu X, Liu X, Ge S, Jensen JD, Hu FY, Li X, Dong Y, Gutenkunst RN, Fang L, Huang L, Li JX, He WM, Zhang GJ, Zheng XM, Zhang FM, Li YR, Yu C, Kristiansen K, Zhang XQ, Wang J, Wright M, McCouch S, Nielsen R, Wang J, Wang W: Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes. Nat Biotech. 2011, 30: 105-111. 10.1038/nbt.2050.
Huang X, Wei X, Sang T, Zhao Q, Feng Q, Zhao Y, Li C, Zhu C, Lu T, Zhang Z, Li M, Fan D, Guo Y, Wang A, Wang L, Deng L, Li W, Lu Y, Weng Q, Liu K, Huang T, Zhou T, Jing Y, Lin Z, Buckler ES, Qian Q, Zhang QF, Li J, Han B: Genome-wide association studies of 14 agronomic traits in rice landraces. Nat Genet. 2010, 42: 961-967. 10.1038/ng.695.
Huang X, Kurata N, Wei X, Wang Z, Wang A, Zhao Q, Zhao Y, Liu L, Lu H, Li W, Guo Y, Lu Y, Zhou C, Fan D, Weng Q, Zhu C, Huang T, Zhang L, Wang Y, Feng L, Furuumi H, Kubo T, Miyabayashi T, Yuan X, Xu Q, Dong G, Zhan Q, Li C, Fujiyama A, Toyoda A: A map of rice genome variation reveals the origin of cultivated rice. Nature. 2012, 490: 497-501. 10.1038/nature11532.
Zhang HL, Zhang DL, Wang MX, Sun JL, Qi YW, Li JJ, Wei XH, Han LZ, Qiu ZG, Tang SX, Li ZC: A core collection and mini core collection of Oryza sativa L. in China. Theor Appl Genet. 2011, 122: 49-61. 10.1007/s00122-010-1421-7.
International Rice Genebank Collection. [http://irri.org/our-work/seeds]
China National Crop Gene Bank. [http://icgr.caas.net.cn/cgris_english.html]
Kawahara Y, De la Bastide M, Hamilton JP, Kanamori H, McCombie WR, Ouyang S, Schwartz DC, Tanaka T, Wu J, Zhou S, Childs KL, Davidson RM, Lin H, Quesada-Ocampo L, Vaillancourt B, Sakai H, Lee SS, Kim J, Numa H, Itoh T, Buell CR, Matsumoto T: Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice. 2013, 6: 4-10.1186/1939-8433-6-4.
Li H, Durbin R: Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009, 25: 1754-1760. 10.1093/bioinformatics/btp324.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: 1000 Genome Project Data Processing Subgroup: The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009, 25: 2078-2079. 10.1093/bioinformatics/btp352.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA: The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20: 1297-1303. 10.1101/gr.107524.110.
TreeBeST: Tree building guided by species tree. [http://treesoft.sourceforge.net/treebest.shtml]
DARwin software. [http://darwin.cirad.fr/]
Rice Genome Annotation Project. [http://rice.plantbiology.msu.edu/]
The 3,000 Rice Genome Project: The Rice 3,000 Genome Project. GigaScience Database. 2014,http://dx.doi.org/10.5524/200001,
Li JY, Wang J, Zeigler RS: The 3000 Rice Genome Project: opportunities and challenges for future rice research. GigaScience. 2014, 3: 8-