Accelerated Profile HMM Searches

PLoS Computational Biology - Tập 7 Số 10 - Trang e1002195
Sean R. Eddy1
1HHMI Janelia Farm Research Campus, Ashburn, Virginia, United States of America.

Tóm tắt

Từ khóa


Tài liệu tham khảo

SF Altschul, 1990, Basic local alignment search tool., J Mol Biol, 215, 403, 10.1016/S0022-2836(05)80360-2

SF Altschul, 1997, Gapped BLAST and PSIBLAST: A new generation of protein database search programs., Nucleic Acids Res, 25, 3389, 10.1093/nar/25.17.3389

C Camacho, 2009, BLAST+: Architecture and applications., BMC Bioinformatics, 10, 421, 10.1186/1471-2105-10-421

A Krogh, 1994, Hidden Markov models in computational biology: Applications to protein modeling., J Mol Biol, 235, 1501, 10.1006/jmbi.1994.1104

R Durbin, 1998, Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids, 10.1017/CBO9780511790492

SF Altschul, 2001, The estimation of statistical parameters for local alignment score distributions., Nucleic Acids Res, 29, 351, 10.1093/nar/29.2.351

AA Schäffer, 2001, Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements., Nucleic Acids Res, 29, 2994, 10.1093/nar/29.14.2994

SF Altschul, 2005, Protein database searches using compositionally adjusted substitution matrices., FEBS J, 272, 5101, 10.1111/j.1742-4658.2005.04945.x

YK Yu, 2006, Retrieval accuracy, statistical significance and compositional similarity in protein sequence database searches., Nucleic Acids Res, 34, 5966, 10.1093/nar/gkl731

S Hunter, 2009, InterPro: the integrative protein signature database., Nucleic Acids Res, 37, D211, 10.1093/nar/gkn785

RD Finn, 2010, The Pfam protein families database., Nucleic Acids Res, 38, D211, 10.1093/nar/gkp985

V Chaudhary, 2006, Parallel implementations of local sequence alignment: Hardware and software., 233

JP Walters, 2006, Accelerating the HMMER sequence analysis suite using conventional processors., 10.1109/AINA.2006.68

J Landman, 2006, Accelerating HMMer searches on Opteron processors with minimally invasive recoding., 10.1109/AINA.2006.67

V Sachdeva, 2007, Exploring the viability of the Cell Broadband Engine for bioinformatics applications., Parallel Comput, 34, 616, 10.1016/j.parco.2008.04.001

RP Maddimsetty, 2006, Acceleration of Profile-HMM Search for Protein Sequences in Reconfigurable Hardware [Master's thesis]

S Derrien, 2007, Parallelizing HMMER for hardware acceleration on FPGAs., 10.1109/ASAP.2007.4429951

A Jacob, 2007, Preliminary results in accelerating profile HMM search on FPGAs., 10.1109/IPDPS.2007.370447

T Oliver, 2007, High performance database searching with HMMer on FPGAs., 10.1109/IPDPS.2007.370448

DR Horn, 2005, ClawHMMER: A streaming HMMer-search implementation., 10.1109/SC.2005.18

JP Walters, 2009, Evaluating the use of GPUs in liver image segmentation and HMMER database searches., 10.1109/IPDPS.2009.5161073

G Chukkapalli, 2004, SledgeHMMER: A web server for batch searching the Pfam database., Nucleic Acids Res, 32, W542, 10.1093/nar/gkh395

B Rekapalli, 2009, HSP-HMMER: A tool for protein identification on a large scale., 10.1145/1529282.1529443

Y Sun, 2007, Designing patterns for profile HMM search., Bioinformatics, 23, e36, 10.1093/bioinformatics/btl323

Y Sun, 2009, Designing patterns and profiles for faster HMM search., IEEE/ACM Trans Comput Biol Bioinform, 6, 232, 10.1109/TCBB.2008.14

LS Johnson, 2010, Hidden Markov model speed heuristic and iterative HMM search procedure., BMC Bioinformatics, 11, 431, 10.1186/1471-2105-11-431

TF Smith, 1981, Identification of common molecular subsequences., J Mol Biol, 147, 195, 10.1016/0022-2836(81)90087-5

M Madera, 2002, A comparison of profile hidden Markov model procedures for remote homology detection., Nucleic Acids Res, 30, 4321, 10.1093/nar/gkf544

S Johnson, 2006, Remote Protein Homology Detection Using Hidden Markov Models [Ph.D. thesis]

EK Freyhult, 2007, Exploring genomic dark matter: A critical assessment of the performance of homology search methods on noncoding RNA., Genome Res, 17, 117, 10.1101/gr.5890907

SR Eddy, 2008, A probabilistic model of local sequence alignment that simplifies statistical significance estimation., PLoS Comput Biol, 4, e1000069, 10.1371/journal.pcbi.1000069

T Rognes, 2001, ParAlign: A parallel sequence alignment algorithm for rapid and sensitive database searches., Nucleic Acids Res, 29, 1647, 10.1093/nar/29.7.1647

M Farrar, 2007, Striped Smith-Waterman speeds database searches six times over other SIMD implementations., Bioinformatics, 23, 156, 10.1093/bioinformatics/btl582

A Wozniak, 1997, Using video-oriented instructions to speed up sequence comparison., Comput Applic Biosci, 13, 145

T Rognes, 2000, Six-fold speed-up of Smith-Waterman sequence database searches using parallel processing on common microprocessors., Bioinformatics, 16, 699, 10.1093/bioinformatics/16.8.699

A Milosavljević, 1993, Discovering simple DNA sequences by the algorithmic significance method., Comput Applic Biosci, 9, 407

K Karplus, 1998, Hidden Markov models for detecting remote protein homologies., Bioinformatics, 14, 846, 10.1093/bioinformatics/14.10.846

LR Rabiner, 1989, A tutorial on hidden Markov models and selected applications in speech recognition., Proc IEEE, 77, 257, 10.1109/5.18626

SJ Melnikoff, 2003, Implementing the log-add algorithm in hardware., Electronics Letters, 12, 939, 10.1049/el:20030594

WR Pearson, 2000, Flexible sequence similarity searching with the FASTA3 program package., Meth Mol Biol, 132, 185

WN Grundy, 1998, Homology detection via family pairwise search., J Comput Biol, 5, 479, 10.1089/cmb.1998.5.479

EM Gertz, 2006, Composition-based statistics and translated nucleotide searches: Improving the TBLASTN module of BLAST., BMC Biol, 4, 41, 10.1186/1741-7007-4-41

2011, Ongoing and future developments at the universal protein resource., Nucleic Acids Res, 39, D214, 10.1093/nar/gkq1020

GA Price, 2005, Statistical evaluation of pairwise protein sequence comparison with the Bayesian bootstrap., Bioinformatics, 21, 3824, 10.1093/bioinformatics/bti627