Mauve: Multiple Alignment of Conserved Genomic Sequence With Rearrangements

Genome Research - Tập 14 Số 7 - Trang 1394-1403 - 2004
Aaron E. Darling1, Bob Mau2,3, Frederick R. Blattner4,5, Nicole T. Perna2,5
1Department of Computer Science, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA.
2Department of Animal Health and Biomedical Sciences,
3Department of Oncology
4Department of Genetics, and
5Genome Center of Wisconsin, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA

Tóm tắt

As genomes evolve, they undergo large-scale evolutionary processes that present a challenge to sequence comparison not posed by short sequences. Recombination causes frequent genome rearrangements, horizontal transfer introduces new sequences into bacterial chromosomes, and deletions remove segments of the genome. Consequently, each genome is a mosaic of unique lineage-specific segments, regions shared with a subset of other genomes and segments conserved among all the genomes under consideration. Furthermore, the linear order of these segments may be shuffled among genomes. We present methods for identification and alignment of conserved genomic DNA in the presence of rearrangements and horizontal transfer. Our methods have been implemented in a software package called Mauve. Mauve has been applied to align nine enterobacterial genomes and to determine global rearrangement structure in three mammalian genomes. We have evaluated the quality of Mauve alignments and drawn comparison to other methods through extensive simulations of genome evolution.

Từ khóa


Tài liệu tham khảo

10.1089/106652701753216503

10.1101/gr.10.7.950

1997, Genome Inform Ser. Workshop Genome Inform., 8, 25

10.1126/science.277.5331.1453

2002, Genome Res., 12, 26

10.1093/nar/gkg623

10.1101/gr.789803

10.1101/gr.926603

2003, Bioinformatics, 19 Suppl 1, I54

2003, Bioinformatics, 19 Suppl 1, i74

10.1093/bioinformatics/btg378

10.1093/nar/27.11.2369

10.1128/JB.185.7.2330-2337.2003

10.1126/science.1086132

10.1093/embo-reports/kve097

10.1093/dnares/8.1.11

2002, Bioinformatics, 18 Suppl 1, S312

10.1093/nar/gkf566

10.1101/gr.10.8.1115

2000, Proc. Int. Conf. Intell. Syst. Mol. Biol., 8, 228

10.1111/1467-9868.00356

10.1093/bioinformatics/18.3.452

10.1093/bioinformatics/btf843

2003, Bioinformatics, 19 Suppl 1, I190

Martins, W.S., del Cuvillo, J., Cui, W., and Gao, G.R. 2001. Whole genome alignment using a multithreaded parallel implementation. Symposium on Computer Architecture and High Performance Computing, pp. 1–8.

10.1038/35101614

10.1093/bioinformatics/15.3.211

10.1093/bioinformatics/16.10.948

10.1073/pnas.93.22.12098

10.1006/jmbi.2000.4042

10.1038/35101607

10.1038/35054089

10.1101/gr.757503

10.1073/pnas.1330369100

10.1093/bioinformatics/13.3.235

10.1038/nature02053

10.1099/ijs.0.01472-0

10.1093/nar/gkg579

10.1101/gr.809403

10.1093/nar/22.22.4673

10.1093/nar/27.13.2682

10.1038/79918

2003, Nat. Rev. Genet., 4, 251

10.1128/IAI.71.5.2775-2786.2003

10.1073/pnas.252529799

http://gel.ahabs.wisc.edu/mauve; the Mauve alignment system and visualization environment.