Surface antigens and potential virulence factors from parasites detected by comparative genomics of perfect amino acid repeats

Springer Science and Business Media LLC - Tập 5 - Trang 1-9 - 2007
Niklaus Fankhauser1, Tien-Minh Nguyen-Ha1, Joël Adler2, Pascal Mäser1
1Institute of Cell Biology, University of Bern, Bern, Switzerland
2Pädagogische Hochschule Bern, Bern, Switzerland

Tóm tắt

Many parasitic organisms, eukaryotes as well as bacteria, possess surface antigens with amino acid repeats. Making up the interface between host and pathogen such repetitive proteins may be virulence factors involved in immune evasion or cytoadherence. They find immunological applications in serodiagnostics and vaccine development. Here we use proteins which contain perfect repeats as a basis for comparative genomics between parasitic and free-living organisms. We have developed Reptile http://reptile.unibe.ch , a program for proteome-wide probabilistic description of perfect repeats in proteins. Parasite proteomes exhibited a large variance regarding the proportion of repeat-containing proteins. Interestingly, there was a good correlation between the percentage of highly repetitive proteins and mean protein length in parasite proteomes, but not at all in the proteomes of free-living eukaryotes. Reptile combined with programs for the prediction of transmembrane domains and GPI-anchoring resulted in an effective tool for in silico identification of potential surface antigens and virulence factors from parasites. Systemic surveys for perfect amino acid repeats allowed basic comparisons between free-living and parasitic organisms that were directly applicable to predict proteins of serological and parasitological importance. An on-line tool is available at http://genomics.unibe.ch/dora .

Tài liệu tham khảo

Marcotte EM, Pellegrini M, Yeates TO, Eisenberg D: A census of protein repeats. J Mol Biol 1999, 293: 151–160. 10.1006/jmbi.1999.3136 Andrade MA, Ponting CP, Gibson TJ, Bork P: Homology-based method for identification of protein repeats using statistical significance estimates. J Mol Biol 2000, 298: 521–537. 10.1006/jmbi.2000.3684 Andrade MA, Perez-Iratxeta C, Ponting CP: Protein repeats: structures, functions, and evolution. J Struct Biol 2001, 134: 117–131. 10.1006/jsbi.2001.4392 Heger A, Holm L: Rapid automatic detection and alignment of repeats in protein sequences. Proteins 2000, 41: 224–237. 10.1002/1097-0134(20001101)41:2<224::AID-PROT70>3.0.CO;2-Z Radar [http://www.ebi.ac.uk/Radar] Repro [http://ibivu.cs.vu.nl/programs/reprowww] George RA, Heringa J: The REPRO server: finding protein internal sequence repeats through the Web. Trends Biochem Sci 2000, 25: 515–517. 10.1016/S0968-0004(00)01643-1 Pellegrini M, Marcotte EM, Yeates TO: A fast algorithm for genome-wide analysis of proteins with repeated sequences. Proteins 1999, 35: 440–446. 10.1002/(SICI)1097-0134(19990601)35:4<440::AID-PROT7>3.0.CO;2-Y Internal Repeats Finder [http://nihserver.mbi.ucla.edu/Repeats] Katti MV, Sami-Subbu R, Ranjekar PK, Gupta VS: Amino acid repeat patterns in protein sequences: their diversity and structural-functional implications. Protein Sci 2000, 9: 1203–1209. TRIPS [http://www.ncl-india.org/trips] Szklarczyk R, Heringa J: Tracking repeats using significance and transitivity. Bioinformatics 2004, 20 Suppl 1: I311-I317. 10.1093/bioinformatics/bth911 Trust [http://zeus.cs.vu.nl/programs/trustwww/] Murray KB, Taylor WR, Thornton JM: Toward the detection and validation of repeats in protein structure. Proteins 2004, 57: 365–380. 10.1002/prot.20202 Depledge DP, Lower RP, Smith DF: RepSeq--a database of amino acid repeats present in lower eukaryotic pathogens. BMC Bioinformatics 2007, 8: 122. 10.1186/1471-2105-8-122 RepSeq [http://www.repseq.gugbe.com] REP [http://www.embl-heidelberg.de/~andrade/papers/rep/search.html] Gruber M, Soding J, Lupas AN: REPPER--repeats and their periodicities in fibrous proteins. Nucleic Acids Res 2005, 33: W239–43. 10.1093/nar/gki405 Repper [http://toolkit.tuebingen.mpg.de/repper] Kalita MK, Ramasamy G, Duraisamy S, Chauhan VS, Gupta D: ProtRepeatsDB: a database of amino acid repeats in genomes. BMC Bioinformatics 2006, 7: 336. 10.1186/1471-2105-7-336 ProtRepeatsDB [http://bioinfo.icgeb.res.in/repeats] Leid RW, Suquet CM, Tanigoshi L: Parasite defense mechanisms for evasion of host attack; a review. Vet Parasitol 1987, 25: 147–162. 10.1016/0304-4017(87)90101-4 Kedzierski L, Montgomery J, Curtis J, Handman E: Leucine-rich repeats in host-pathogen interactions. Arch Immunol Ther Exp (Warsz) 2004, 52: 104–112. Roditi I, Carrington M, Turner M: Expression of a polypeptide containing a dipeptide repeat is confined to the insect stage of Trypanosoma brucei. Nature 1987, 325: 272–274. 10.1038/325272a0 Vassella E, Acosta-Serrano A, Studer E, Lee SH, Englund PT, Roditi I: Multiple procyclin isoforms are expressed differentially during the development of insect forms of Trypanosoma brucei. J Mol Biol 2001, 312: 597–607. 10.1006/jmbi.2001.5004 Enea V, Ellis J, Zavala F, Arnot DE, Asavanich A, Masuda A, Quakyi I, Nussenzweig RS: DNA cloning of Plasmodium falciparum circumsporozoite gene: amino acid sequence of repetitive epitope. Science 1984, 225: 628–630. 10.1126/science.6204384 Peacock SJ, Moore CE, Justice A, Kantzanou M, Story L, Mackie K, O'Neill G, Day NP: Virulent combinations of adhesin and toxin genes in natural populations of Staphylococcus aureus. Infect Immun 2002, 70: 4987–4996. 10.1128/IAI.70.9.4987-4996.2002 Beadle C, Long GW, Weiss WR, McElroy PD, Maret SM, Oloo AJ, Hoffman SL: Diagnosis of malaria by detection of Plasmodium falciparum HRP-2 antigen with a rapid dipstick antigen-capture assay. Lancet 1994, 343: 564–568. 10.1016/S0140-6736(94)91520-2 Snounou G, Renia L: The vaccine is dead--long live the vaccine. Trends Parasitol 2007, 23: 129–132. 10.1016/j.pt.2007.02.001 Ansari FA, Kumar N, Bala Subramanyam M, Gnanamani M, Ramachandran S: MAAP: Malarial adhesins and adhesin-like proteins predictor. Proteins 2007. Samen U, Eikmanns BJ, Reinscheid DJ, Borges F: The surface protein Srr-1 of Streptococcus agalactiae binds human keratin 4 and promotes adherence to epithelial HEp-2 cells. Infect Immun 2007. Brinster S, Posteraro B, Bierne H, Alberti A, Makhzami S, Sanguinetti M, Serror P: Enterococcal leucine-rich repeat-containing protein involved in virulence and host inflammatory response. Infect Immun 2007, 75: 4463–4471. 10.1128/IAI.00279-07 Tomley FM, Billington KJ, Bumstead JM, Clark JD, Monaghan P: EtMIC4: a microneme protein from Eimeria tenella that contains tandem arrays of epidermal growth factor-like repeats and thrombospondin type-I repeats. Int J Parasitol 2001, 31: 1303–1310. 10.1016/S0020-7519(01)00255-7 de la Fuente J, Garcia-Garcia JC, Barbet AF, Blouin EF, Kocan KM: Adhesion of outer membrane proteins containing tandem repeats of Anaplasma and Ehrlichia species (Rickettsiales: Anaplasmataceae) to tick cells. Vet Microbiol 2004, 98: 313–322. 10.1016/j.vetmic.2003.11.001 Cherny I, Rockah L, Levy-Nissenbaum O, Gophna U, Ron EZ, Gazit E: The formation of Escherichia coli curli amyloid fibrils is mediated by prion-like peptide repeats. J Mol Biol 2005, 352: 245–252. Inclusion-exclusion principle [http://en.wikipedia.org/wiki/Inclusion-exclusion_principle] Reptile [http://reptile.unibe.ch] Katinka MD, Duprat S, Cornillot E, Metenier G, Thomarat F, Prensier G, Barbe V, Peyretaillade E, Brottier P, Wincker P, Delbac F, El Alaoui H, Peyret P, Saurin W, Gouy M, Weissenbach J, Vivares CP: Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi. Nature 2001, 414: 450–453. 10.1038/35106579 Petersen C, Nelson R, Leech J, Jensen J, Wollish W, Scherf A: The gene product of the Plasmodium falciparum 11.1 locus is a protein larger than one megadalton. Mol Biochem Parasitol 1990, 42: 189–195. 10.1016/0166-6851(90)90161-E Ilg T: Proteophosphoglycans of Leishmania. Parasitol Today 2000, 16: 489–497. 10.1016/S0169-4758(00)01791-9 Campuzano J, Aguilar D, Arriaga K, Leon JC, Salas-Rangel LP, Gonzalez-y-Merchand J, Hernandez-Pando R, Espitia C: The PGRS domain of Mycobacterium tuberculosis PE_PGRS Rv1759c antigen is an efficient subunit vaccine to prevent reactivation in a murine model of chronic tuberculosis. Vaccine 2007, 25: 3722–3729. 10.1016/j.vaccine.2006.12.042 Kall L, Krogh A, Sonnhammer EL: A combined transmembrane topology and signal peptide prediction method. J Mol Biol 2004, 338: 1027–1036. 10.1016/j.jmb.2004.03.016 Fankhauser N, Maser P: Identification of GPI anchor attachment signals by a Kohonen self-organizing map. Bioinformatics 2005, 21: 1846–1852. 10.1093/bioinformatics/bti299 Dora [http://genomics.unibe.ch/dora] Usdin M, Guillerm M, Chirac P: Neglected tests for neglected patients. Nature 2006, 441: 283–284. 10.1038/441283a FIND diagnostics [http://www.finddiagnostics.org] Pruess M, Kersey P, Apweiler R: The Integr8 project--a resource for genomic and proteomic data. In Silico Biol 2005, 5: 179–185. [ftp://ftp.ebi.ac.uk/pub/databases/integr8/] Mann Whitney test [http://en.wikipedia.org/wiki/Mann-Whitney_U] Wilcoxon test [http://en.wikipedia.org/wiki/Wilcoxon_signed-rank_test] Spearman correlation [http://en.wikipedia.org/wiki/Spearman_correlation] Eddy SR: Multiple alignment using hidden Markov models. Proc Int Conf Intell Syst Mol Biol 1995, 3: 114–120.