A novel hybrid gene prediction method employing protein multiple sequence alignments
Tóm tắt
Từ khóa
Tài liệu tham khảo
Attwood, 1994, Prints–a protein motif fingerprint database, Protein Eng., 7, 841, 10.1093/protein/7.7.841
Attwood, 2003, Prints and its automatic supplement, preprints, Nucleic Acids Res., 31, 400, 10.1093/nar/gkg030
Castellana, 2008, Discovery and revision of arabidopsis genes by proteogenomics, Proc. Natl Acad. Sci. USA, 105, 21034, 10.1073/pnas.0811066106
Harrow, 2009, Identifying protein-coding genes in genomic sequences, Genome Biol., 10, 201, 10.1186/gb-2009-10-1-201
Henikoff, 1991, Automated assembly of protein blocks for database searching, Nucleic Acids Res., 19, 6565, 10.1093/nar/19.23.6565
Henikoff, 1990, Finding protein similarities with nucleotide sequence databases, Methods Enzymol., 183, 111, 10.1016/0076-6879(90)83009-X
Henikoff, 1999, Blocks+: a non-redundant database of protein alignment blocks derived from multiple compilations, Bioinformatics, 15, 471, 10.1093/bioinformatics/15.6.471
Hunter, 2009, Interpro: the integrative protein signature database, Nucleic Acids Res., 37, D211, 10.1093/nar/gkn785
Keller, 2008, Scipio: using protein sequences to determine the precise exon/intron structures of genes and their orthologs in closely related species, BMC Bioinformatics, 9, 278, 10.1186/1471-2105-9-278
Kent, 2002, Blat–the blast-like alignment tool, Genome Res., 12, 656
Metzker, 2010, Sequencing technologies - the next generation, Nat. Rev. Genet., 11, 31, 10.1038/nrg2626
Meyer, 2004, Gene structure conservation aids similarity based gene prediction, Nucleic Acids Res., 32, 776, 10.1093/nar/gkh211
Odronitz, 2006, Pfarao: a web application for protein family analysis customized for cytoskeletal and motor proteins (cymobase), BMC Genomics, 7, 300, 10.1186/1471-2164-7-300
Odronitz, 2008, Webscipio: An online tool for the determination of gene structures using protein sequences, BMC Genomics, 9, 422, 10.1186/1471-2164-9-422
Pietrokovski, 1996, The blocks database–a system for protein classification, Nucleic Acids Res., 24, 197, 10.1093/nar/24.1.197
Quevillon, 2005, Interproscan: protein domains identifier, Nucleic Acids Res., 33, W116, 10.1093/nar/gki442
Slater, 2005, Automated generation of heuristics for biological sequence comparison, BMC Bioinformatics, 6, 31, 10.1186/1471-2105-6-31
Stanke, 2003, Gene prediction with a hidden Markov model and a new intron submodel, Bioinformatics, 19, 215, 10.1093/bioinformatics/btg1080
Stanke, 2006, Augustus at egasp: using est, protein and genomic alignments for improved gene prediction in the human genome, Genome Biol., 7, 1
Stanke, 2006, Gene prediction in eukaryotes with a generalized hidden markov model that uses hints from external sources, BMC Bioinformatics, 7, 62, 10.1186/1471-2105-7-62