BEDTools: a flexible suite of utilities for comparing genomic features

Bioinformatics - Tập 26 Số 6 - Trang 841-842 - 2010
Aaron R. Quinlan1, Ira M. Hall1
11 Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine and 2 Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22908, USA

Tóm tắt

Abstract

Motivation: Testing for correlations between different sets of genomic features is a fundamental task in genomics research. However, searching for overlaps between features with existing web-based methods is complicated by the massive datasets that are routinely produced with current sequencing technologies. Fast and flexible tools are therefore required to ask complex questions of these data in an efficient manner.

Results: This article introduces a new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format. BEDTools also supports the comparison of sequence alignments in BAM format to both BED and GFF features. The tools are extremely efficient and allow the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks. BEDTools can be combined with one another as well as with standard UNIX commands, thus facilitating routine genomics tasks as well as pipelines that can quickly answer intricate questions of large genomic datasets.

Availability and implementation: BEDTools was written in C++. Source code and a comprehensive user manual are freely available at http://code.google.com/p/bedtools

Contact:  [email protected]; [email protected]

Supplementary information:  Supplementary data are available at Bioinformatics online.

Từ khóa


Tài liệu tham khảo

Giardine, 2005, Galaxy: a platform for interactive large-scale genome analysis, Genome Res., 15, 1451, 10.1101/gr.4086505

Kent, 2002, The human genome browser at UCSC, Genome Res., 12, 996, 10.1101/gr.229102

Li, 2009, The Sequence Alignment/Map format and SAMtools, Bioinformatics, 25, 2078, 10.1093/bioinformatics/btp352

Pearson, 1988, Improved tools for biological sequence comparison, Proc. Natl Acad. Sci. USA, 85, 2444, 10.1073/pnas.85.8.2444

Smit, 1996, RepeatMasker. Open-3.0.