Characterizing short read sequencing for gene discovery and RNA-Seq analysis in Crassostrea gigas

Mackenzie R. Gavery1, Steven B. Roberts1
1School of Aquatic and Fishery Sciences, University of Washington, 1122 NE Boat Street, Seattle, WA 98105, USA

Tài liệu tham khảo

Altschul, 1990, Basic local alignment search tool, J. Mol. Biol., 215, 403, 10.1016/S0022-2836(05)80360-2 Baggerly, 2003, Differential expression in SAGE: accounting for normal between-library variation, Bioinformatics, 19, 1477, 10.1093/bioinformatics/btg173 Blankenberg, 2010, Galaxy: a web-based genome analysis tool for experimentalists, Curr. Protoc. Mol. Biol., 19, 10.1002/0471142727.mb1910s89 Craft, 2010, Pyrosequencing of Mytilus galloprovincialis cDNAs: tissue-specific expression patterns, Plos One, 5, e8875, 10.1371/journal.pone.0008875 Cullum, 2011, The next generation: using new sequencing technologies to analyse gene regulation, Respirology, 16, 210, 10.1111/j.1440-1843.2010.01899.x de Lorgeril, 2011, Whole transcriptome profiling of successful immune response to Vibrio infections in the oyster Crassostrea gigas by digital gene expression analysis, PLoS One, 6, e23142, 10.1371/journal.pone.0023142 Dohm, 2008, Substantial biases in ultra-short read data sets from high-throughput DNA sequencing, Nucleic Acids Res., 36, e105, 10.1093/nar/gkn425 Etebari, 2011, Deep sequencing-based transcriptome analysis of Plutella xylostella larvae parasitized by Diadegma semiclausum, BMC Genomics, 12, 446, 10.1186/1471-2164-12-446 Everett, 2011, Short reads and non-model species: exploring the complexities of next generation sequence assembly and SNP discovery in the absence of a reference genome, Mol. Ecol. Resour., 11, 93, 10.1111/j.1755-0998.2010.02969.x Ewing, 1998, Base-calling of automated sequencer traces using phred. II. Error probabilities, Genome Res., 8, 186, 10.1101/gr.8.3.186 Ewing, 1998, Base-calling of automated sequencer traces using phred. I. Accuracy assessment, Genome Res., 8, 175, 10.1101/gr.8.3.175 Feldmeyer, 2011, Short read Illumina data for the de novo assembly of a non-model snail species transcriptome (Radix balthica, Basommatophora, Pulmonata), and a comparison of assembler performance, BMC Genomics, 12, 317, 10.1186/1471-2164-12-317 Fleury, 2009, Generation and analysis of a 29,745 unique Expressed Sequence Tags from the Pacific oyster (Crassostrea gigas) assembled into a publicly accessible database: the GigasDatabase, BMC Genomics, 10, 341, 10.1186/1471-2164-10-341 Fraser, 2011, Sequencing and characterization of the guppy (Poecilia reticulata) transcriptome, BMC Genomics, 12, 202, 10.1186/1471-2164-12-202 Goecks, 2010, The Galaxy Team. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences, Genome Biol., 1, R86, 10.1186/gb-2010-11-8-r86 Goetz, 2010, A genetic basis for the phenotypic differentiation between siscowet and lean lake trout (Salvelinus namaycush), Mol. Ecol., 19, 176, 10.1111/j.1365-294X.2009.04481.x Gueguen, 2003, Immune gene discovery by expressed sequence tags generated from hemocytes of the bacteria-challenged oyster, Crassostrea gigas, Gene, 303, 139, 10.1016/S0378-1119(02)01149-6 Ha, 2009, Coordination of multiple dual oxidase–regulatory pathways in responses to commensal and infectious microbes in Drosophila gut, Nat. Immunol., 10, 949, 10.1038/ni.1765 Harrison, 1976, Studies on the chlorinating activity of myeloperoxidase, J. Biol. Chem., 251, 1371, 10.1016/S0021-9258(17)33749-3 Hoffman, 1991, Cloning of a factor required for activity of the Ah (dioxin) receptor, Science, 252, 954, 10.1126/science.1852076 Huang, 2009, Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists, Nucleic Acids Res., 37, 1, 10.1093/nar/gkn923 Huang, 2009, Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources, Nat. Protoc., 4, 44, 10.1038/nprot.2008.211 Johansson, 1999, Cell adhesion molecules in invertebrate immunity, Dev. Comp. Immunol., 23, 303, 10.1016/S0145-305X(99)00013-0 Kawahara-Miki, 2011, Expression profiling without genome sequence information in a non-model species, pandalid shrimp (Pandalus latirostris), by next-generation sequencing, PLoS One, 6, e26043, 10.1371/journal.pone.0026043 Koutsogiannaki, 2011, Effect of 17β-estradiol on adhesion of Mytilus galloprovincialis hemocytes to selected substrates. Role of alpha2 integrin subunit, Fish Shellfish Immunol., 31, 73, 10.1016/j.fsi.2011.04.003 Lamprou, 2007, Distinct signalling pathways promote phagocytosis of bacteria, latex beads and lipopolysaccharide in medfly haemocytes, Immunology, 121, 314, 10.1111/j.1365-2567.2007.02576.x Lin, 2010, White shrimp Litopenaeus vannamei that had received the hot-water extract of Spirulina platensis showed earlier recovery in immunity and up-regulation of gene expression after pH stress, Fish Shellfish Immunol., 29, 1092, 10.1016/j.fsi.2010.09.002 Meyer, 2009, Sequencing and de novo analysis of a coral larval transcriptome using 454 GS-Flx, BMC Genomics, 10, 10.1186/1471-2164-10-219 Meyer, 2011, Profiling gene expression responses of coral larvae (Acropora millepora) to elevated temperature and settlement inducers using a novel RNA-Seq procedure, Mol. Ecol., 20, 3599 Mortazavi, 2008, Mapping and quantifying mammalian transcriptomes by RNA-Seq, Nat. Methods, 5, 585, 10.1038/nmeth.1226 Newton, 2002 Novaes, 2008, High-throughput gene and SNP discovery in Eucalyptus grandis, an uncharacterized genome, BMC Genomics, 9, 312, 10.1186/1471-2164-9-312 Rice, 2009, Mutations involved in aicardi-goutieres syndrome implicate samhd1 as regulator of the innate immune response, Nat. Genet., 41, 829, 10.1038/ng.373 Roberts, 2008, Analysis of genes isolated from plated hemocytes of the Pacific oysters Crassostrea gigas, Mar. Biotechnol., 11, 24, 10.1007/s10126-008-9117-6 Schlenk, 1991, Studies on myeloperoxidase activity in the common mussel, Mytilus edulis, Comp. Biochem. Physiol., 99C, 63 Seeb, 2010, Transcriptome sequencing and high-resolution melt analysis advance SNP discovery in duplicated salmonids, Mol. Ecol. Resour., 11, 335, 10.1111/j.1755-0998.2010.02936.x Shendure, 2008, Next-generation DNA sequencing, Nat. Biotechnol., 26, 1135, 10.1038/nbt1486 Vera, 2008, Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing, Mol. Ecol., 17, 1636, 10.1111/j.1365-294X.2008.03666.x Wang, 2009, RNA-Seq: a revolutionary tool for transcriptomics, Nat. Rev. Genet., 10, 57, 10.1038/nrg2484 Washington State Department of Health: Office of Shellfish and Protection, 2006