Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification

Bioinformatics - Tập 21 Số 5 - Trang 650-659 - 2005
Itai Yanai1, Hila Benjamin1, Michael Shmoish1, Vered Chalifa‐Caspi2, Maxim Shklar1, Ron Ophir2, Arren Bar‐Even1, Shirley Horn‐Saban2, Marilyn Safran2, Eytan Domany3, Doron Lancet1, Orit Shmueli1
1Department of Molecular Genetics, Weizmann Institute of Science, 76100 Rehovot, Israel
2Department of Biological Services, Weizmann Institute of Science, 76100 Rehovot, Israel
3Department of Physics of Complex Systems, Weizmann Institute of Science, 76100 Rehovot, Israel

Tóm tắt

Abstract Motivation: Genes are often characterized dichotomously as either housekeeping or single-tissue specific. We conjectured that crucial functional information resides in genes with midrange profiles of expression. Results: To obtain such novel information genome-wide, we have determined the mRNA expression levels for one of the largest hitherto analyzed set of 62 839 probesets in 12 representative normal human tissues. Indeed, when using a newly defined graded tissue specificity index τ, valued between 0 for housekeeping genes and 1 for tissue-specific genes, genes with midrange profiles having 0.15 < τ < 0.85 were found to constitute >50% of all expression patterns. We developed a binary classification, indicating for every gene the IB tissues in which it is overly expressed, and the 12 − IB tissues in which it shows low expression. The 85 dominant midrange patterns with IB = 2–11 were found to be bimodally distributed, and to contribute most significantly to the definition of tissue specification dendrograms. Our analyses provide a novel route to infer expression profiles for presumed ancestral nodes in the tissue dendrogram. Such definition has uncovered an unsuspected correlation, whereby de novo enhancement and diminution of gene expression go hand in hand. These findings highlight the importance of gene suppression events, with implications to the course of tissue specification in ontogeny and phylogeny. Availability: All data and analyses are publically available at the GeneNote website, http://genecards.weizmann.ac.il/genenote/ and, GEO accession GSE803. Contact:  [email protected] Supplementary information: Four tables available at the above site.

Từ khóa


Tài liệu tham khảo

Affymetrix. 2001Microarray Suite User Guide, Version 5

Bakay, M., Zhao, P., Chen, J., Hoffman, E.P. 2002A web-accessible complete transcriptome of normal human and DMD muscle. Neuromuscul. Disord.12(Suppl. 1),S125–S141

Benjamini, Y. and Hochberg, Y. 1995Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B57289–300

Blatt, M., Wiseman, S., Domany, E. 1996Superparamagnetic clustering of data. Phys. Rev. Lett.763251–3254

Burgess, R., Lunyak, V., Rosenfeld, M. 2002Signaling and transcriptional control of pituitary development. Curr. Opin. Genet. Dev.12534–539

Cerutti, H. 2003RNA interference: traveling in the cell and gaining functions?. Trends Genet.1939–46

Chalifa-Caspi, V., Shmueli, O., Benjamin-Rodrig, H., Rosen, N., Shmoish, M., Yanai, I., Ophir, R., Kats, P., Safran, M., Lancet, D. 2003GeneAnnot: interfacing GeneCards with high throughput gene expression compendia. Brief. Bioinformatics4349–360

Chalifa-Caspi, V., Yanai, I., Ophir, R., Rosen, N., Shmoish, M., Benjamin-Rodrig, H., Shklar, M., Stein, T.I., Shmueli, O., Safran, M., et al. 2004GeneAnnot: comprehensive two-way linking between oligonucleotide array probesets and GeneCards genes. Bioinformatics201457–1458

Cho, Y., Fernandes, J., Kim, S.H., Walbot, V. 2002Gene-expression profile comparisons distinguish seven organs of maize. Genome Biol.3research0045

Eisenberg, E. and Levanon, E.Y. 2003Human housekeeping genes are compact. Trends Genet.19362–365

Getz, G., Levine, E., Domany, E. 2000Coupled two-way clustering analysis of gene microarray data. Proc. Natl Acad. Sci. USA9712079–12084

Halfon, M.S. and Michelson, A.M. 2002Exploring genetic regulatory networks in metazoan development: methods and models. Physiol. Genomics10131–143

Haverty, P.M., Weng, Z., Best, N.L., Auerbach, K.R., Hsiao, L.L., Jensen, R.V., Gullans, S.R. 2002HugeIndex: a database with visualization tools for high-density oligonucleotide array data from normal human tissues. Nucleic Acids Res.30214–217

Hsia, C.C. and McGinnis, W. 2003Evolution of transcription factor function. Curr. Opin. Genet. Dev.13199–206

Hsiao, L.L., Dangond, F., Yoshida, T., Hong, R., Jensen, R.V., Misra, J., Dillon, W., Lee, K.F., Clark, K.E., Haverty, P., et al. 2001A compendium of gene expression in normal human tissues. Physiol. Genomics797–104

Hubbell, E., Liu, W.M., Mei, R. 2002Robust estimators for expression analysis. Bioinformatics181585–1592

Iacobuzio-Donahue, C.A., Maitra, A., Shen-Ong, G.L., van Heek, T., Ashfaq, R., Meyer, R., Walter, K., Berg, K., Hollingsworth, M.A., Cameron, J.L., et al. 2002Discovery of novel tumor markers of pancreatic cancer using global gene expression technology. Am. J. Pathol.1601239–1249

Irizarry, R., Hobbs, B., Collin, F., Beazer-Barclay, Y.D., Antonellis, K.J., Scherf, U., Speed, T.P. 2003Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics4249–264

Kannan, K., Kaminski, N., Rechavi, G., Jakob-Hirsch, J., Amariglio, N., Givol, D. 2001DNA microarray analysis of genes involved in p53 mediated apoptosis: activation of Apaf-1. Oncogene203449–3455

Kent, W.J. 2002BLAT—the BLAST-like alignment tool. Genome Res.12656–664

Khaitovich, P., Weiss, G., Lachmann, M., Hellmann, I., Enard, W., Muetzel, B., Wirkner, U., Ansorge, W., Paabo, S. 2004A neutral model of transcriptome evolution. PLOS Biol.2E132

Lercher, M.J., Urrutia, A.O., Hurst, L.D. 2002Clustering of housekeeping genes provides a unified model of gene order in the human genome. Nat. Genet.31180–183

Liu, G., Loraine, A.E., Shigeta, R., Cline, M., Cheng, J., Valmeekam, V., Sun, S., Kulp, D., Siani-Rose, M.A. 2003NetAffx: affymetrix probesets and annotations. Nucleic Acids Res.3182–86

Mariani, T.J., Budhraja, V., Mecham, B.H., Gu, C.C., Watson, M.A., Sadovsky, Y. 2002A variable fold-change threshold determines significance for expression microarrays. FASEB J.17321–323

Rosen, N., Chalifa-Caspi, V., Shmueli, O., Adato, A., Lapidot, M., Stampnitzky, J., Safran, M., Lancet, D. 2003GeneLoc: exon-based integration of human genome maps. Bioinformatics19(Suppl. 1),I222–I224

Safran, M., Solomon, I., Shmueli, O., Lapidot, M., Shen-Orr, S., Adato, A., Ben-Dor, U., Esterman, N., Rosen, N., Peter, I., et al. 2002GeneCards (2002): towards a complete, object-oriented, human gene compendium. Bioinformatics181542–1543

Saito-Hisaminato, A., Katagiri, T., Kakiuchi, S., Nakamura, T., Tsunoda, T., Nakamura, Y. 2002Genome-wide profiling of gene expression in 29 normal human tissues with a cDNA microarray. DNA Res.935–45

Shklar, M., et al. 2004GeneTide: Terra Incognita Discovery Endeavor Mining ESTs and Expression Data to Elucidate Known and De-Noro GeneCards Genes. 478–479 Proceedings of the 2004 IEEE Computational Systems Bioinformatics Conference, CSB2004

Shmueli, O., Horn-Saban, S., Chalifa-Caspi, V., Shmoish, M., Ophir, R., Benjamin-Rodrig, R., Safran, M., Domany, E., Lancet, D. 2003GeneNote: whole genome expression profiles in normal human tissues. C.R. Biologies3261067–1072

Slonim, D.K. 2002From patterns to pathways: gene expression data analysis comes of age. Nat Genet.32(Suppl.),502–508

St Johnston, D. 2002The art and design of genetic screens: Drosophila melanogaster . Nat. Rev. Genet.3176–188

Su, A.I., Cooke, M.P., Ching, K.A., Hakak, Y., Walker, J.R., Wiltshire, T., Orth, A.P., Vega, R.G., Sapinoso, L.M., Moqrich, A., et al. 2002Large-scale analysis of the human and mouse transcriptomes. Proc. Natl Acad. Sci. USA994465–4470

Su, A.I., Wiltshire, T., Batalov, S., Lapp, H., Ching, K.A., Block, D., Zhang, J., Soden, R., Hayakawa, M., Kreiman, G., et al. 2004A gene atlas of the mouse and human protein-encoding transcriptomes. Proc. Natl Acad. Sci. USA1016062–6067

Warrington, J.A., Nair, A., Mahadevappa, M., Tsyganskaya, M. 2000Comparison of human adult and fetal expression and identification of 535 housekeeping/maintenance genes. Physiol. Genomics2143–147

Wheeler, D.L., Church, D.M., Federhen, S., Lash, A.E., Madden, T.L., Pontius, J.U., Schuler, G.D., Schriml, L.M., Sequeira, E., Tatusova, T.A., et al. 2003Database resources of the National Center for Biotechnology. Nucleic Acids Res.3128–33

Yanai, I., Graur, D., Ophir, R. 2004Incongruent expression profiles between human and mouse orthologous genes suggest widespread neutral evolution of transcription control. OMICS815–24