SequenceCEROSENE: a computational method and web server to visualize spatial residue neighborhoods at the sequence level
Tóm tắt
To understand the molecular function of biopolymers, studying their structural characteristics is of central importance. Graphics programs are often utilized to conceive these properties, but with the increasing number of available structures in databases or structure models produced by automated modeling frameworks this process requires assistance from tools that allow automated structure visualization. In this paper a web server and its underlying method for generating graphical sequence representations of molecular structures is presented. The method, called SequenceCEROSENE (color encoding of residues obtained by spatial neighborhood embedding), retrieves the sequence of each amino acid or nucleotide chain in a given structure and produces a color coding for each residue based on three-dimensional structure information. From this, color-highlighted sequences are obtained, where residue coloring represent three-dimensional residue locations in the structure. This color encoding thus provides a one-dimensional representation, from which spatial interactions, proximity and relations between residues or entire chains can be deduced quickly and solely from color similarity. Furthermore, additional heteroatoms and chemical compounds bound to the structure, like ligands or coenzymes, are processed and reported as well. To provide free access to SequenceCEROSENE, a web server has been implemented that allows generating color codings for structures deposited in the Protein Data Bank or structure models uploaded by the user. Besides retrieving visualizations in popular graphic formats, underlying raw data can be downloaded as well. In addition, the server provides user interactivity with generated visualizations and the three-dimensional structure in question. Color encoded sequences generated by SequenceCEROSENE can aid to quickly perceive the general characteristics of a structure of interest (or entire sets of complexes), thus supporting the researcher in the initial phase of structure-based studies. In this respect, the web server can be a valuable tool, as users are allowed to process multiple structures, quickly switch between results, and interact with generated visualizations in an intuitive manner. The SequenceCEROSENE web server is available at
https://biosciences.hs-mittweida.de/seqcerosene
.
Tài liệu tham khảo
Hutchinson EG, Thornton JM. HERA–a program to draw schematic diagrams of protein secondary structures. Proteins. 1990; 8(3):203–12. doi:http://dx.doi.org/10.1002/prot.340080303.
Bond CS. TopDraw: a sketchpad for protein structure topology cartoons. Bioinformatics. 2003; 19(2):311–2.
Stivala A, Wybrow M, Wirth A, Whisstock JC, Stuckey PJ. Automatic generation of protein structure cartoons with Pro-origami. Bioinformatics. 2011; 27(23):3315–6. doi:http://dx.doi.org/10.1093/bioinformatics/btr575.
Westhead DR, Slidel TW, Flores TP, Thornton JM. Protein structural topology: Automated analysis and diagrammatic representation. Protein Sci. 1999; 8(4):897–904. doi:http://dx.doi.org/10.1110/ps.8.4.897.
de Beer TAP, Berka K, Thornton JM, Laskowski RA. PDBsum additions. Nucleic Acids Res. 2014; 42(Database issue):292–6. doi:http://dx.doi.org/10.1093/nar/gkt940.
Omasits U, Ahrens CH, Müiller S, Wollscheid B. Protter: interactive protein feature visualization and integration with experimental proteomic data. Bioinformatics. 2014; 30(6):884–6. doi:http://dx.doi.org/10.1093/bioinformatics/btt607.
Martin AJM, Vidotto M, Boscariol F, Di Domenico T, Walsh I, Tosatto SCE. RING: networking interacting residues, evolutionary information and energetics in protein structures. Bioinformatics. 2011; 27(14):2003–5. doi:http://dx.doi.org/10.1093/bioinformatics/btr191.
Dhifli W, Diallo B. PGR: A Novel Graph Repository of Protein 3D-Structures. J Data Mining Genomics Proteomics. 2015; 6(2):1–4.
Vehlow C, Stehr H, Winkelmann M, Duarte JM, Petzold L, Dinse J, et al.CMView: interactive contact map visualization and analysis. Bioinformatics. 2011; 27(11):1573–4. doi:http://dx.doi.org/10.1093/bioinformatics/btr163.
Sobolev V, Eyal E, Gerzon S, Potapov V, Babor M, Prilusky J, et al.SPACE: a suite of tools for protein structure prediction and analysis based on complementarity and environment. Nucleic Acids Res. 2005; 33(Web Server issue):39–43. doi:http://dx.doi.org/10.1093/nar/gki398.
Berman HM, Kleywegt GJ, Nakamura H, Markley JL. The Protein Data Bank archive as an open data resource. J Comput Aided Mol Des. 2014; 28(10):1009–14. doi:http://dx.doi.org/10.1007/s10822-014-9770-y.
Schultz SC, Shields GC, Steitz TA. Crystal structure of a CAP-DNA complex: the DNA is bent by 90 degrees. Science. 1991; 253(5023):1001–7.
The PyMOL Molecular Graphics System. CA: DeLano Scientific. 2002. Available online at https://www.pymol.org/. Accessed 22 Sept 2015.
Humphrey W, Dalke A, Schulten K. VMD: visual molecular dynamics. J Mol Graph. 1996; 14(1):8233–78.
Biasini M. PV - WebGL-based protein viewer. 2014. doi:http://dx.doi.org/10.5281/zenodo.12620.