Transcription Factor Information System (TFIS): A Tool for Detection of Transcription Factor Binding Sites

Priyanka Narad1, Abhishek Kumar1, Amlan Chakraborty1, Pranav Patni1, Abhishek Sengupta1, Gulshan Wadhwa2, K. C. Upadhyaya3
1Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
2Department of Biotechnology, Ministry of Science & Technology, New Delhi, India
3Amity Institute of Molecular Biology and Genomics, Amity University Uttar Pradesh, Noida, India

Tóm tắt

Transcription factors are trans-acting proteins that interact with specific nucleotide sequences known as transcription factor binding site (TFBS), and these interactions are implicated in regulation of the gene expression. Regulation of transcriptional activation of a gene often involves multiple interactions of transcription factors with various sequence elements. Identification of these sequence elements is the first step in understanding the underlying molecular mechanism(s) that regulate the gene expression. For in silico identification of these sequence elements, we have developed an online computational tool named transcription factor information system (TFIS) for detecting TFBS for the first time using a collection of JAVA programs and is mainly based on TFBS detection using position weight matrix (PWM). The database used for obtaining position frequency matrices (PFM) is JASPAR and HOCOMOCO, which is an open-access database of transcription factor binding profiles. Pseudo-counts are used while converting PFM to PWM, and TFBS detection is carried out on the basis of percent score taken as threshold value. TFIS is equipped with advanced features such as direct sequence retrieving from NCBI database using gene identification number and accession number, detecting binding site for common TF in a batch of gene sequences, and TFBS detection after generating PWM from known raw binding sequences in addition to general detection methods. TFIS can detect the presence of potential TFBSs in both the directions at the same time. This feature increases its efficiency. And the results for this dual detection are presented in different colors specific to the orientation of the binding site. Results obtained by the TFIS are more detailed and specific to the detected TFs as integration of more informative links from various related web servers are added in the result pages like Gene Ontology, PAZAR database and Transcription Factor Encyclopedia in addition to NCBI and UniProt. Common TFs like SP1, AP1 and NF-KB of the Amyloid beta precursor gene is easily detected using TFIS along with multiple binding sites. In another scenario of embryonic developmental process, TFs of the FOX family (FOXL1 and FOXC1) were also identified. TFIS is platform-independent which is publicly available along with its support and documentation at http://tfistool.appspot.com and http://www.bioinfoplus.com/tfis/ . TFIS is licensed under the GNU General Public License, version 3 (GPL-3.0).

Tài liệu tham khảo

Wheelock CE, Wheelock ÅM, Kawashima S, Diez D, Kanehisa M, van Erk M, Goto S (2009) Systems biology approaches and pathway tools for investigating cardiovascular disease. Mol BioSyst 5(6):588–602 Ong CT, Corces VG (2011) Enhancer function: new insights into the regulation of tissue-specific gene expression. Nat Rev Genet 12(4):283–293 Tompa M, Li N, Bailey TL, Church GM, De Moor B, Eskin E, Makeev VJ (2005) Assessing computational tools for the discovery of transcription factor binding sites. Nat Biotechnol 23(1):137–144 van Helden J, André B, Collado-Vides J (1998) Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. J Mol Biol 281(5):827–842 Vilo J, Brazma A, Jonassen I, Robinson AJ, Ukkonen E (2000) Mining for putative regulatory elements in the yeast genome using gene expression data. In Ismb 2000:384–394 Akiyama Y (1995) TFSEARCH: searching transcription factor binding sites. Real World Computing Partnership, Japan Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Voss N (2006) TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes. Nucleic Acids Res 34(suppl 1):D108–D110 Messeguer X, Escudero R, Farré D, Núñez O, Martĺnez J, Albà MM (2002) PROMO: detection of known transcription regulatory elements using species-tailored searches. Bioinformatics 18(2):333–334 Lenhard B, Wasserman WW (2002) TFBS: computational framework for transcription factor binding site analysis. Bioinformatics 18(8):1135–1136 Tan G (2014) TFBSTools: software package for transcription factor binding site (TFBS) analysis. R package version 1(0) Mathelier A, Zhao X, Zhang AW, Parcy F, Worsley-Hunt R, Arenillas DJ, Lim J (2013) JASPAR 2014: an extensively expanded and updated open-access database of transcription factor binding profiles. Nucleic Acids Res gkt997 Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL (2004) GenBank: update. Nucleic Acids Res 32(Database issue):D23 Bucher P (1990) Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. J Mol Biol 212:563–578 Wasserman WW, Sandelin A (2004) Applied bioinformatics for the identification of regulatory elements. Nat Rev Genet 5:276–287 Nishida K, Frith MC, Nakai K (2009) Pseudocounts for transcription factor binding sites. Nucleic Acids Res 37:939–944 Pan Y, Phan S (2008) Threshold for positional weight matrix. Eng Lett 16:498–504 Barrett T, Soboleva A et al (2013) NCBI GEO: archive for functional genomics data sets–update. Nucleic Acids Res 41:991–995 Guenther MG, Frampton GM, Soldner F, Hockemeyer D, Mitalipova M, Jaenisch R, Young RA (2010) Chromatin structure and gene expression programs of human embryonic and induced pluripotent stem cells. Cell Stem Cell 7:249–257 Schoumacher M, Shao W et al (2014) Inhibiting Tankyrases sensitizes KRAS-mutant cancer cells to MEK inhibitors via FGFR2 feedback signaling. Cancer Res 74:3294–3305 Cabeza-Arvelaiz Y, Schiestl RH (2012) Transcriptome analysis of a rotenone model of parkinsonism reveals complex I-tied and-untied toxicity mechanisms common to neurodegenerative diseases. PLoS One 7:44700 Kim WJ, Rivera MN, Coffman EJ, Haber DA (2012) The WTX tumor suppressor enhances p53 acetylation by CBP/p300. Mol Cell 45:587–597 Dunckley T, Stephan DA et al (2006) Gene expression correlates of neurofibrillary tangles in Alzheimer’s disease. Neurobiol Aging 27:1359–1371 Theuns J, Christine VB (2000) Transcriptional regulation of Alzheimer’s disease genes: implications for susceptibility. Hum Mol Genet 9:2383–2394 Qiu W, Hu Y, Andersen TE, Jafari A, Li N, Chen W, Kassem M (2010) Tumor necrosis factor receptor superfamily member 19 (TNFRSF19) regulates differentiation fate of human mesenchymal (stromal) stem cells through canonical Wnt signaling and C/EBP. J Biol Chem 285:14438–14449 Katoh M, Katoh M (2007) Conserved POU/OCT- and GATA-binding sites in 5′-flanking promoter region of mammalian WNT8B orthologs. Int J Oncol 30:1273–1277 Zehentner BK, Dony C, Burtscher H (1999) The transcription factor Sox9 is involved in BMP-2 signaling. J Bone Miner Res 14:1734–1741 Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Sherlock G (2000) Gene Ontology: tool for the unification of biology. Nat Genet 25(1):25–29 Portales-Casamar E, Kirov S, Lim J, Lithwick S, Swanson MI, Ticoll A, Wasserman WW (2007) PAZAR: a framework for collection and dissemination of cis-regulatory sequence annotation. Genome Biol 8(10):R207 Yusuf D, Butland SL, Swanson MI, Bolotin E, Ticoll A, Cheung WA, Prince KL (2012) The transcription factor encyclopedia. Genome Biol 13(3):R24