Biopython: freely available Python tools for computational molecular biology and bioinformatics

Bioinformatics - Tập 25 Số 11 - Trang 1422-1423 - 2009
Peter Cock1, Tiago Antão1, Jeffrey T. Chang1, Brad Chapman1, Cymon J. Cox1, Andrew Dalke1, Iddo Friedberg1, Thomas Hamelryck1, Frank Kauff1, Bartek Wilczyński1, Michiel de Hoon1,2
11 Plant Pathology, SCRI, Invergowrie, Dundee, DD2 5DA, 2Liverpool School of Tropical Medicine, Liverpool, L3 5QA, UK, 3Institute for Genome Sciences and Policy, Duke University Medical Center, Durham, NC, 4Department of Molecular Biology, Simches Research Center, Massachusetts General Hospital, Boston, MA 02114, USA, 5Centro de Ciências do Mar, Universidade do Algarve, Faro, Portugal, 6Andrew Dalke Scientific, AB, Gothenburg, Sweden, 7California Institute for Telecommunications and Information Technology, University of California, San Diego, 9500 Gilman Dr., La Jolla, CA 92093-0446, USA, 8Bioinformatics Center, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, 2200 Copenhagen N, Denmark, 9Molecular Phylogenetics, Department of Biology, TU Kaiserslautern, 67653 Kaiserslautern, UK, 10EMBL Heidelberg, Meyerhofstraβe 1, 69117 Heidelberg, Germany, 11Institute of Informatics, University of Warsaw, Poland and 12RIKEN Omics Science Center, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama-shi, Kanagawa-ken,
2RIKEN

Tóm tắt

Abstract Summary: The Biopython project is a mature open source international collaboration of volunteer developers, providing Python libraries for a wide range of bioinformatics problems. Biopython includes modules for reading and writing different sequence file formats and multiple sequence alignments, dealing with 3D macro molecular structures, interacting with common tools such as BLAST, ClustalW and EMBOSS, accessing key online databases, as well as providing numerical methods for statistical learning. Availability: Biopython is freely available, with documentation and source code at www.biopython.org under the Biopython license. Contact: All queries should be directed to the Biopython mailing lists, see www.biopython.org/wiki/[email protected].

Từ khóa


Tài liệu tham khảo

Chapman, 2000, Biopython: Python tools for computational biology, ACM SIGBIO Newslett., 20, 15, 10.1145/360262.360268

Chaudhuri, 2006, xBASE, a collection of online databases for bacterial comparative genomics, Nucleic Acids Res., 34, D335, 10.1093/nar/gkj140

Bateman, 2004, The Pfam protein families database, Nucleic Acids Res., 32, D138, 10.1093/nar/gkh121

Beaumont, 1996, Evaluating loci for use in the genetic analysis of population structure, Proc. R. Soc. Lond. B, 263, 1619, 10.1098/rspb.1996.0237

Benson, 2007, GenBank, Nucleic Acids Res., 35, D21, 10.1093/nar/gkl986

Felsenstein, 1989, PHYLIP -phylogeny inference package (Version 3.2), Cladistics, 5, 164

Hamelryck, 2003, PDB file parser and structure class implemented in Python, Bioinformatics, 19, 2308, 10.1093/bioinformatics/btg299

Hinsen, 2000, The molecular modeling toolkit: a new approach to molecular simulations, J. Comp. Chem., 21, 79, 10.1002/(SICI)1096-987X(20000130)21:2<79::AID-JCC1>3.0.CO;2-B

Holland, 2008, BioJava: an open-source framework for bioinformatics, Bioinformatics, 24, 2096, 10.1093/bioinformatics/btn397

De Hoon, 2004, Open source clustering software, Bioinformatics, 20, 1453, 10.1093/bioinformatics/bth078

Kauff, 2007, WASABI: an automated sequence processing system for multi-gene phylogenies, Syst. Biol., 56, 523, 10.1080/10635150701395340

Kulikova, 2006, EMBL nucleotide sequence database in 2006, Nucleic Acids Res., 35, D16, 10.1093/nar/gkl913

Lavel, 2004, SIMCOAL 2.0: a program to simulate genomic diversity over large recombining regions in a subdivided population with a complex history, Bioinformatics, 20, 2485, 10.1093/bioinformatics/bth264

Maddison, 1997, NEXUS: an extensible file format for systematic information, Syst. Biol., 46, 590, 10.1093/sysbio/46.4.590

Oliphant, 2006, Guide to NumPy.

Oliphant, 2007, Python for Scientific Computing, Comput. Sci. Eng., 9, 10, 10.1109/MCSE.2007.58

Pearson, 1988, Improved tools for biological sequence analysis, PNAS, 85, 2444, 10.1073/pnas.85.8.2444

Pritchard, 2006, GenomeDiagram: a Python package for the visualisation of large-scale genomic data, Bioinformatics, 22, 616, 10.1093/bioinformatics/btk021

Rice, 2000, EMBOSS: the European molecular biology open software suite, Trends Genet., 16, 276, 10.1016/S0168-9525(00)02024-2

Rousset, 2007, GENEPOP '007: a complete re-implementation of the GENEPOP software for Windows and Linux, Mol. Ecol. Res., 8, 103, 10.1111/j.1471-8286.2007.01931.x

Stajich, 2002, The Bioperl toolkit: Perl modules for the life sciences, Genome Res., 12, 1611, 10.1101/gr.361602

Thompson, 1994, CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice, Nucleic Acids Res., 22, 4673, 10.1093/nar/22.22.4673