The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools

Nucleic Acids Research - Tập 40 Số D1 - Trang D1202-D1210 - 2012
Philippe Lamesch1, Tanya Berardini1, Donghui Li1, David Swarbreck1, Christopher Wilks1, Rajkumar Sasidharan1, Robert Müller1, Kate Dreher1, Debbie L. Alexander1, M. Garcia-Hernandez1, Athikkattuvalasu S. Karthikeyan1, Cynthia H. Lee1, William D. Nelson1, L. Ploetz1, Shanker K. Singh1, April Wensel1, Eva Huala1
1Department of Plant Biology, Carnegie Institution, 260 Panama St, Stanford, CA 94305, USA.

Tóm tắt

Từ khóa


Tài liệu tham khảo

National Research Council, 2008, Funding a Revolution: Achievements of the National Plant Genome Initiative and New Horizons in Plant Biology

Xu, 2011, The value of Arabidopsis research in understanding human disease states, Curr. Opin. Biotechnol., 22, 300, 10.1016/j.copbio.2010.11.007

Koornneef, 2010, The development of Arabidopsis as a model plant, Plant J., 61, 909, 10.1111/j.1365-313X.2009.04086.x

Buell, 2010, Twenty-first century plant biology: impacts of the Arabidopsis genome on plant biology and agriculture, Plant Physiol., 154, 497, 10.1104/pp.110.159541

Avni, 2011, Can plant biotechnology help in solving our food and energy shortage in the future? Curr, Opin. Biotechnol., 22, 220, 10.1016/j.copbio.2011.01.007

Chew, 2011, A stress-free walk from Arabidopsis to crops, Curr. Opin. Biotechnol., 22, 281, 10.1016/j.copbio.2010.11.011

Zhang, 2011, Arabidopsis as a model for wood formation, Curr. Opin. Biotechnol., 22, 293, 10.1016/j.copbio.2010.11.008

Hays, 2002, Arabidopsis thaliana, a versatile model system for study of eukaryotic genome-maintenance functions, DNA Repair, 1, 579, 10.1016/S1568-7864(02)00093-9

van Baarlen, 2007, Disease induction by human microbial pathogens in plant-model systems: potential, problems and prospects, Drug Discov. Today, 12, 167, 10.1016/j.drudis.2006.12.007

Jones, 2008, The impact of Arabidopsis on human health: diversifying our portfolio, Cell, 133, 939, 10.1016/j.cell.2008.05.040

Schlaich, 2011, Arabidopsis thaliana – the model plant to study host-pathogen interactions, Curr. Drug Targets, 12, 955, 10.2174/138945011795677863

Gene Ontology Consortium, 2010, The Gene Ontology in 2010: extensions and refinements, Nucleic Acids Res., 38, D331, 10.1093/nar/gkp1018

Jaiswal, 2005, Plant Ontology (PO): a controlled vocabulary of plant structures and growth stages, Comp. Funct. Genomics, 6, 388, 10.1002/cfg.496

Reference Genome Group of the Gene Ontology Consortium, 2009, The Gene Ontology's Reference Genome Project: a unified framework for functional annotation across species, PLoS Comput. Biol., 5, e1000431, 10.1371/journal.pcbi.1000431

Zdobnov, 2001, InterProScan – an integration platform for the signature-recognition methods in InterPro, Bioinformatics, 17, 847, 10.1093/bioinformatics/17.9.847

Emanuelsson, 2007, Locating proteins in the cell using TargetP, SignalP and related tools, Nat. Protoc., 2, 953, 10.1038/nprot.2007.131

Van Auken, 2009, Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curation, BMC Bioinformatics, 10, 228, 10.1186/1471-2105-10-228

Haas, 2005, Complete reannotation of the Arabidopsis genome: methods, tools, protocols and the final release, BMC Biol., 3, 7, 10.1186/1741-7007-3-7

Lewis, 2002, Apollo: a sequence annotation editor, Genome Biol., 3, research0082, 10.1186/gb-2002-3-12-research0082

Haas, 2003, Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies, Nucleic Acids Res., 31, 5654, 10.1093/nar/gkg770

Swarbreck, 2008, The Arabidopsis Information Resource (TAIR): gene structure and function annotation, Nucleic Acids Res., 36, D1009, 10.1093/nar/gkm965

Buisine, 2008, Improved detection and annotation of transposable elements in sequenced genomes using multiple reference sequence sets, Genomics, 91, 467, 10.1016/j.ygeno.2008.01.005

Hayden, 2007, Identification of novel conserved peptide uORF homology groups in Arabidopsis and rice reveals ancient eukaryotic origin of select groups and preferential association with transcription factor-encoding genes, BMC Biol., 5, 32, 10.1186/1741-7007-5-32

Hanada, 2007, A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and/or under purifying selection, Genome Res., 17, 632, 10.1101/gr.5836207

Alexandrov, 2006, Features of Arabidopsis genes and genome discovered using full-length cDNAs, Plant Mol. Biol., 60, 69, 10.1007/s11103-005-2564-9

Backman, 2008, Update of ASRP: the Arabidopsis Small RNA Project database, Nucleic Acids Res., 36, D982, 10.1093/nar/gkm997

Aubourg, 2007, Analysis of CATMA transcriptome data identifies hundreds of novel functional genes and improves gene models in the Arabidopsis genome, BMC Genomics, 8, 401, 10.1186/1471-2164-8-401

Lister, 2008, Highly integrated single-base resolution maps of the epigenome in Arabidopsis, Cell, 133, 523, 10.1016/j.cell.2008.03.029

Baerenfaller, 2008, Genome-scale proteomics reveals Arabidopsis thaliana gene models and proteome dynamics, Science, 320, 938, 10.1126/science.1157956

Castellana, 2008, Discovery and revision of Arabidopsis genes by proteogenomics, Proc. Natl Acad. Sci. USA, 105, 21034, 10.1073/pnas.0811066106

Zhang, 2006, PseudoPipe: an automated pseudogene identification pipeline, Bioinformatics, 22, 1437, 10.1093/bioinformatics/btl116

Schiex, 2001, Eugène, an eukaryotic gene finder that combines several sources of evidence, Lect. Notes Comp. Sci., 2066/2001, 111, 10.1007/3-540-45727-5_10

Thierry-Mieg, 2006, AceView: a comprehensive cDNA-supported gene and transcripts annotation, Genome Biol., 7, S12.1, 10.1186/gb-2006-7-s1-s12

Ossowski, 2008, Sequencing of natural strains of Arabidopsis thaliana with short reads, Genome Res., 18, 2024, 10.1101/gr.080200.108

Filichkin, 2010, Genome-wide mapping of alternative splicing in Arabidopsis thaliana, Genome Res., 20, 45, 10.1101/gr.093302.109

Trapnell, 2009, TopHat: discovering splice junctions with RNA-Seq, Bioinformatics, 25, 1105, 10.1093/bioinformatics/btp120

Bryant, 2010, Supersplat – spliced RNA-seq alignment, Bioinformatics, 26, 1500, 10.1093/bioinformatics/btq206

Stanke, 2006, AUGUSTUS: ab initio prediction of alternative transcripts, Nucleic Acids Res., 34, W435, 10.1093/nar/gkl200

Trapnell, 2010, Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation, Nat. Biotechnol., 28, 511, 10.1038/nbt.1621

Baerenfaller, 2011, pep2pro: a new tool for comprehensive proteome data analysis to reveal information about organ-specific proteomes in Arabidopsis thaliana, Integr. Biol., 3, 225, 10.1039/c0ib00078g

Müller, 2004, Textpresso: an ontology-based information retrieval and extraction system for biological literature, PLoS Biol., 2, e309, 10.1371/journal.pbio.0020309

Kao, 2008, Browsing multidimensional molecular networks with the generic network browser (N-Browse), Curr. Protoc. Bioinformatics, Chapter 9, 10.1002/0471250953.bi0911s23

Stark, 2011, The BioGRID Interaction Database: 2011 update, Nucleic Acids Res., 39, D698, 10.1093/nar/gkq1116

Aranda, 2010, The IntAct molecular interaction database in 2010, Nucleic Acids Res., 38, D525, 10.1093/nar/gkp878

McKay, 2010, Using the Generic Synteny Browser (GBrowse_syn), Curr. Protoc. Bioinformatics, Chapter 9, 10.1002/0471250953.bi0912s31

Nicol, 2009, The Integrated Genome Browser: free software for distribution and exploration of genome-scale datasets, Bioinformatics, 25, 2730, 10.1093/bioinformatics/btp472

Wu, 2005, GMAP: a genomic mapping and alignment program for mRNA and EST sequences, Bioinformatics, 21, 1859, 10.1093/bioinformatics/bti310

Li, 2007, A cross-species alignment tool (CAT), BMC Bioinformatics, 8, 349, 10.1186/1471-2105-8-349

Zhang, 2010, Creation of a genome-wide metabolic pathway database for Populus trichocarpa using a new approach for reconstruction and curation of metabolic pathways for plants, Plant Physiol., 153, 1479, 10.1104/pp.110.157396