Mining functional subgraphs from cancer protein-protein interaction networks

BMC Systems Biology - Tập 6 - Trang 1-14 - 2012
Ru Shen1,2, Nalin CW Goonesekere3, Chittibabu Guda1,4
1Department of Genetics, Cell Biology, and Anatomy, University of Nebraska Medical Center, Omaha, USA
2Department of Computer Science, State University of New York at Albany, Albany, USA
3Department of Chemistry and Biochemistry, University of Northern Iowa, Cedar Falls, USA
4Bioinformatics and Systems Biology Core Facility, University of Nebraska Medical Center, Omaha, USA

Tóm tắt

Protein-protein interaction (PPI) networks carry vital information about proteins' functions. Analysis of PPI networks associated with specific disease systems including cancer helps us in the understanding of the complex biology of diseases. Specifically, identification of similar and frequently occurring patterns (network motifs) across PPI networks will provide useful clues to better understand the biology of the diseases. In this study, we developed a novel pattern-mining algorithm that detects cancer associated functional subgraphs occurring in multiple cancer PPI networks. We constructed nine cancer PPI networks using differentially expressed genes from the Oncomine dataset. From these networks we discovered frequent patterns that occur in all networks and at different size levels. Patterns are abstracted subgraphs with their nodes replaced by node cluster IDs. By using effective canonical labeling and adopting weighted adjacency matrices, we are able to perform graph isomorphism test in polynomial running time. We use a bottom-up pattern growth approach to search for patterns, which allows us to effectively reduce the search space as pattern sizes grow. Validation of the frequent common patterns using GO semantic similarity showed that the discovered subgraphs scored consistently higher than the randomly generated subgraphs at each size level. We further investigated the cancer relevance of a select set of subgraphs using literature-based evidences. Frequent common patterns exist in cancer PPI networks, which can be found through effective pattern mining algorithms. We believe that this work would allow us to identify functionally relevant and coherent subgraphs in cancer networks, which can be advanced to experimental validation to further our understanding of the complex biology of cancer.

Tài liệu tham khảo

Cook DJ, Holder LB: Mining graph data. 2007, John Wiley & Sons, Inc.

Borgwardt KM, Kriegel HP: Graph kernels for disease outcome prediction from protein-protein interaction networks. Pacific Symposium on Biocomputing. 2007, 4-15.

Li Z, Zhang S, Wang Y, Zhang XS, Chen L: Alignment of protein interaction networks by integer quadratic programming. Bioinformatics. 2007, 23: 1631-1639. 10.1093/bioinformatics/btm156.

Yan X, Han J: Graph-based substructure pattern mining. Proceedings of 2002 International Conference on Data Mining. 2002, 721-724.

Ketkar NS, Holder LB, Cook DJ: Subdue: Compression-based frequent pattern discovery in graph data. OSDM. 2005

King AD, Przulj N, Jurisica I: Protein complex prediction via cost-based clustering. Bioinformatics. 2004, 20: 3013-3020. 10.1093/bioinformatics/bth351.

Le TV, Kulikowski CA, Muchnik IB: Coring method for clustering a graph. Proceedings of ICPR. 2008

Guda C, King BR, Pal LR, Guda P: A Top-Down Approach to Infer and Compare Domain-Domain Interactions across Eight Model Organisms. PLoS ONE. 2009, 4: e5096-10.1371/journal.pone.0005096.

Xenarios I, Rice DW, Salwinski L: DIP: the Database of Interacting Proteins. Nucleic Acids Res. 2000, 28: 289-291. 10.1093/nar/28.1.289.

Hartke SG, Radcliffe AJ: Mckay's canonical graph labeling algorithm. Contemporary Mathematics book series. 2009, 479: 99-111.

Brown KR, Jurisica I: Online predicted human interaction database. Bioinformatics. 2005, 21: 2076-2082. 10.1093/bioinformatics/bti273.

Wang JZ, Du Z, Payattakool R, Yu PS, Chen CF: A new method to measure the semantic similarity of GO terms. Bioinformatics. 2007, 23: 1274-1281. 10.1093/bioinformatics/btm087.

Schlossmacher GA, Stevens A, White A: Glucocorticoid receptor mediated apoptosis: mechanisms of resistance in cancer cells. J Endocrinol. 2011, 211: 17-25. 10.1530/JOE-11-0135.

Xu J, Wu RC, O'Malley BW: Normal and cancer-related functions of the p160 steroid receptor co-activator (SRC) family. Nat Rev Cancer. 2009, 9: 615-630. 10.1038/nrc2695.

Iyer NG, Ozdag H, Caldas C: p300/CBP and cancer. Oncogene. 2004, 23: 4225-4231. 10.1038/sj.onc.1207118.

Mullighan CG, Zhang J, Kasper LH: CREBBP mutations in relapsed acute lymphoblastic leukaemia. Nature. 2011, 471: 235-239. 10.1038/nature09727.

Tang Y, Zhao W, Chen Y, Zhao Y, Gu W: Acetylation is indispensable for p53 activation. Cell. 2008, 133: 612-626. 10.1016/j.cell.2008.03.025.

Ferreon JC, Lee CW, Arai M, Martinez-Yamout MA, Dyson HJ, Wright PE: Cooperative regulation of p53 by modulation of ternary complex formation with CBP/p300 and HDM2. Proc Natl Acad Sci USA. 2009, 106: 6591-6596. 10.1073/pnas.0811023106.

Shi D, Pop MS, Kulikov R, Love IM, Kung AL, Grossman SR: CBP and p300 are cytoplasmic E4 polyubiquitin ligases for p53. Proc Natl Acad Sci USA. 2009, 106: 16275-16280. 10.1073/pnas.0904305106.

Salicioni AM, Xi M, Vanderveer LA, Balsara B, Testa JR, Dunbrack RL, Godwin AK: Identification and structural analysis of human RBM8A and RBM8B: two highly conserved RNA-binding motif proteins that interact with OVCA1, a candidate tumor suppressor. Genomics. 2000, 69: 54-62. 10.1006/geno.2000.6315.

Eilers M, Eisenman RN: Myc's broad reach. Genes Dev. 2008, 22: 2755-2766. 10.1101/gad.1712408.

Patel JH, Loboda AP, Showe MK, Showe LC, McMahon SB: Analysis of genomic targets reveals complex functions of MYC. Nat Rev Cancer. 2004, 4: 562-568. 10.1038/nrc1393.

Staller P, Peukert K, Kiermaier A, Seoane J, Lukas J, Karsunky H: Repression of p15INK4b expression by Myc through association with Miz-1. Nat Cell Biol. 2001, 3: 392-399. 10.1038/35070076.

Zeller KI, Zhao X, Lee CW, Chiu KP, Yao F, Yustein JT: Global mapping of c-Myc binding sites and target gene networks in human B cells. Proc Natl Acad Sci USA. 2006, 103: 17834-17839. 10.1073/pnas.0604129103.