FIMO: scanning for occurrences of a given motif

Bioinformatics (Oxford, England) - Tập 27 Số 7 - Trang 1017-1018 - 2011

Charles E. Grant¹, Timothy L. Bailey¹, William Stafford Noble¹

¹1 Department of Genome Sciences, University of Washington, Seattle, WA, USA, 2Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia and 3Department of Computer Science and Engineering, University of Washington, Seattle, WA, USA

Tóm tắt

AbstractSummary: A motif is a short DNA or protein sequence that contributes to the biological function of the sequence in which it resides. Over the past several decades, many computational methods have been described for identifying, characterizing and searching with sequence motifs. Critical to nearly any motif-based sequence analysis pipeline is the ability to scan a sequence database for occurrences of a given motif described by a position-specific frequency matrix.Results: We describe Find Individual Motif Occurrences (FIMO), a software tool for scanning DNA or protein sequences with motifs described as position-specific scoring matrices. The program computes a log-likelihood ratio score for each position in a given sequence database, uses established dynamic programming methods to convert this score to a P-value and then applies false discovery rate analysis to estimate a q-value for each position in the given sequence. FIMO provides output in a variety of formats, including HTML, XML and several Santa Cruz Genome Browser formats. The program is efficient, allowing for the scanning of DNA sequences at a rate of 3.5 Mb/s on a single CPU.Availability and Implementation: FIMO is part of the MEME Suite software toolkit. A web server and source code are available at http://meme.sdsc.edu.Contact: [email protected]; [email protected]Supplementary information: Supplementary data are available at Bioinformatics online.

Từ khóa

Tài liệu tham khảo

Bailey, 2003, Searching for statistically significant regulatory modules, Bioinformatics, 19, ii16, 10.1093/bioinformatics/btg1054

Bailey, 1998, Combining evidence using p-values: Application to sequence homology searches, Bioinformatics, 14, 48, 10.1093/bioinformatics/14.1.48

Bailey, 2009, MEME suite: tools for motif discovery and searching, Nucleic Acids Res., 37, W202, 10.1093/nar/gkp335

Haverty, 2004, CisML: an XML-based format for sequence motif detection software, Bioinformatics, 20, 1815, 10.1093/bioinformatics/bth162

Phillips, 2009, CTCF: master weaver of the genome, Cell, 137, 1194, 10.1016/j.cell.2009.06.001

Staden, 1994, Searching for motifs in nucleic acid sequences, Methods Mol. Biol., 25, 93

Storey, 2002, A direct approach to false discovery rates, J. R. Stat. Soci., 64, 479, 10.1111/1467-9868.00346

Storey, 2003, The positive false discovery rate: a bayesian interpretation and the q-value, Ann. Stat., 31, 2013, 10.1214/aos/1074290335

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Về chúng tôi

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích các bài báo, công bố khoa học Việt Nam. Công cụ trợ giúp người nghiên cứu, tạp chí, đơn vị nghiên cứu tra cứu, phân tích và thống kê dữ liệu nghiên cứu khoa học tại Việt Nam và quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia vào Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Công cụ kiểm tra chính tả và thể thức Viver

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA