SNP markers retrieval for a non-model species: a practical approach

Arwa Shahin1, Thomas van Gurp2, Sander Peters3, R.G.F. Visser1, J.M. van Tuyl1, Paul Arens1
1Wageningen University and Research Centre, Plant Breeding, P.O. Box 16, 6700, AJ Wageningen, The Netherlands
2Netherlands Institute of Ecology (NIOO-KNAW), Department of Terrestrial Ecology, P.O. Box 50, 6700 AB Wageningen, The Netherlands
3Wageningen University and Research Centre, Bioscience, P.O. Box 16, 6700, AJ, Wageningen, The Netherlands

Tóm tắt

Từ khóa


Tài liệu tham khảo

Paszkiewicz K, Studholme D: De novo assembly of short sequence reads. Brief Bioinform. 2010, 11 (5): 457-472. 10.1093/bib/bbq020.

Papanicolaou A, Stierli R, ffrench-Constant R, Heckel D: Next generation transcriptomes for next generation genomes using est2assembly. BMC Bioinformatics. 2009, 10 (1): 447-10.1186/1471-2105-10-447.

Kumar S, Blaxter M: Comparing de novo assemblers for 454 transcriptome data. BMC Genomics. 2010, 11 (1): 571-10.1186/1471-2164-11-571.

Palmieri N, Schlötterer C: Mapping accuracy of short reads from massively parallel sequencing and the implications for quantitative expression profiling. PLoS One. 2009, 4 (7): e6323-10.1371/journal.pone.0006323.

Zhang W, Chen J, Yang Y, Tang Y, Shang J, Shen B: A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies. PLoS One. 2011, 6 (3): e17915-10.1371/journal.pone.0017915.

Emrich SJ, Aluru S, Fu Y, Wen T-J, Narayanan M, Guo L, Ashlock DA, Schnable PS: A strategy for assembling the maize (Zea mays L.) genome. Bioinformatics. 2004, 20 (2): 140-147. 10.1093/bioinformatics/bth017.

Tang J, Vosman B, Voorrips R, van der Linden CG, Leunissen J: QualitySNP: a pipeline for detecting single nucleotide polymorphisms and insertions/deletions in EST data from diploid and polyploid species. BMC Bioinformatics. 2006, 7 (1): 438-10.1186/1471-2105-7-438.

Anithakumari AM, Tang J, van Eck HJ, Visser RG, Leunissen JA, Vosman B, van der Linden CG: A pipeline for high throughput detection and mapping of SNPs from EST databases. Mol Breeding: New Strategies In Plant Improvement. 2010, 26 (1): 65-75.

Vera Ruiz EM, Soriano JM, Romero C, Zhebentyayeva T, Terol J, Zuriaga E, Llácer G, Abbott AG, Badenes ML: Narrowing down the apricot Plum pox virus resistance locus and comparative analysis with the peach genome syntenic region. Mol Plant Pathol. 2011, 12 (6): 535-547. 10.1111/j.1364-3703.2010.00691.x.

Rivarola M, Foster JT, Chan AP, Williams AL, Rice DW, Liu X, Melake-Berhan A, Creasy HH, Puiu D, Rosovitz MJ, et al: Castor Bean Organelle genome sequencing and worldwide genetic diversity analysis. PLoS ONE. 2011, 6 (7): e21743-10.1371/journal.pone.0021743. doi:21710.21371/journal.pone.0021743

Gulyani V, Khurana P: Identification and expression profiling of drought-regulated genes in mulberry (Morus sp.) by suppression subtractive hybridization of susceptible and tolerant cultivars. Tree Genet Genomes. 2011, 7 (4): 725-738. 10.1007/s11295-011-0369-3.

Dubey A, Farmer A, Schlueter J, Cannon SB, Abernathy B, Tuteja R, Woodward J, Shah T, Mulasmanovic B, Kudapa H, et al: Defining the transcriptome assembly and its use for genome dynamics and transcriptome profiling studies in Pigeonpea (Cajanus cajan L.). DNA Research. 2011, 18 (3): 153-164. 10.1093/dnares/dsr007.

Franssen SU, Shrestha RP, Bräutigam A, Bornberg-Bauer E, Weber APM: Comprehensive transcriptome analysis of the highly complex Pisum sativum genome using next generation sequencing. BMC Genomics. 2011, 12 (1): 227-10.1186/1471-2164-12-227.

Sakai H, Ikawa H, Tanaka T, Numa H, Minami H, Fujisawa M, Shibata M, Kurita K, Kikuta A, Hamada M, et al: Distinct evolutionary patterns of Oryza glaberrima deciphered by genome sequencing and comparative analysis. Plant J. 2011, 66 (5): 796-805. 10.1111/j.1365-313X.2011.04539.x.

Tillett RL, Ergül A, Albion RL, Schlauch KA, Cramer GR, Cushman JC: Identification of tissue-specific, abiotic stress-responsive gene expression patterns in wine grape (Vitis vinifera L.) based on curation and mining of large-scale EST data sets. BMC Plant Biol. 2011, 11 (1): 86-10.1186/1471-2229-11-86.

Singhal D, Gupta P, Sharma P, Kashyap N, Anand S, Sharma H: In-silico single nucleotide polymorphisms (SNP) mining of Sorghum bicolor genome. Afr J Biotechnol. 2011, 10 (4): 580-583.

Bräutigam A, Mullick T, Schliesky S, Weber APM: Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C3 and C4 species. J Exp Bot. 2011, 62 (9): 3093-3102. 10.1093/jxb/err029.

Gomez-Alvarez V, Teal TK, Schmidt TM: Systematic artifacts in metagenomes from complex microbial communities. ISME J. 2009, 3 (11): 1314-1317. 10.1038/ismej.2009.72.

Kozarewa I, Ning Z, Quail MA, Sanders MJ, Berriman M, Turner DJ: Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of (G+C)-biased genomes. Nat Meth. 2009, 6 (4): 291-295. 10.1038/nmeth.1311.

Parchman T, Geist K, Grahnen J, Benkman C, Buerkle CA: Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics. 2010, 11 (1): 180-10.1186/1471-2164-11-180.

Wall PK, Leebens-Mack J, Chanderbali A, Barakat A, Wolcott E, Liang H, Landherr L, Tomsho L, Hu Y, Carlson J, et al: Comparison of next generation sequencing technologies for transcriptome characterization. BMC Genomics. 2009, 10 (1): 347-10.1186/1471-2164-10-347.

Tang J, Leunissen J, Voorrips R, van der Linden CG, Vosman B: HaploSNPer: a web-based allele and SNP detection tool. BMC Genet. 2008, 9 (1): 23-

Zerbino D, Birney E: Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008, 18 (5): 821-829. 10.1101/gr.074492.107.

Oases. [ http://www.ebi.ac.uk/~zerbino/oases/ ]

ABySS. [ http://www.bcgsc.ca/platform/bioinfo/software/abyss ]

SOAPdenovo. [ http://soap.genomics.org.cn/soapdenovo.html ]

Sterck L, Rombauts S, Vandepoele K, Rouzé P, Van de Peer Y: How many genes are there in plants (... and why are they there)?. Curr Opin Plant Biol. 2007, 10 (2): 199-203. 10.1016/j.pbi.2007.01.004.

Feldmeyer B, Wheat C, Krezdorn N, Rotter B, Pfenninger M: Short read Illumina data for the de novo assembly of a non-model snail species transcriptome (Radix balthica, Basommatophora, Pulmonata), and a comparison of assembler performance. BMC Genomics. 2011, 12 (1): 317-10.1186/1471-2164-12-317.

Lin H, Ouyang S, Egan A, Nobuta K, Haas B, Zhu W, Gu X, Silva J, Meyers B, Buell CR: Characterization of paralogous protein families in rice. BMC Plant Biol. 2008, 8 (1): 18-10.1186/1471-2229-8-18.

Li W, Godzik A: Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006, 22 (13): 1658-1659. 10.1093/bioinformatics/btl158.

Huang X, Madan A: CAP3: a DNA sequence assembly program. Genome Res. 1999, 9 (9): 868-877. 10.1101/gr.9.9.868.