Meta-clustering of gene expression data and literature-based information
Tóm tắt
The current tendency in the life sciences to spawn ever growing amounts of high-throughput assays has led to a situation where the interpretation of data and the formulation of hypotheses lag the pace at which information is produced. Although the first generation of statistical algorithms scrutinizing single, large-scale data sets found their way into the biological community, the great challenge to connect their results to existing knowledge still remains. Despite the fairly large number of biological databases that is currently available, a lot of relevant information is found in free-text format (such as textual annotations, scientific abstracts and full publications). In this paper we explore how an
Từ khóa
Tài liệu tham khảo
R. Baeza-Yates and B. Ribeiro-Neto . Modern Information Retrieval . ACM Press , 1999 . R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. ACM Press, 1999.
A. Ben-Hur , A. Elisseeff , and I. Guyon . A stability based method for discovering structure in clustered data . In Proc of the Seventh Ann Pac Symp Biocomp (PSB 2002 ), pages 6 -- 17 , 2002 . A. Ben-Hur, A. Elisseeff, and I. Guyon. A stability based method for discovering structure in clustered data. In Proc of the Seventh Ann Pac Symp Biocomp (PSB 2002), pages 6--17, 2002.
D. Chaussable and A. Cher . Mining microarray expression data by literature profiling . Genome Biol , 3 , 2002 . D. Chaussable and A. Cher. Mining microarray expression data by literature profiling. Genome Biol, 3, 2002.
W. B. Frakes . Stemming algorithms . in W. B. Frakes and R. Baeze-Yates: Information retrieval . Prentice Hall , 1992 . W. B. Frakes. Stemming algorithms. in W. B. Frakes and R. Baeze-Yates: Information retrieval. Prentice Hall, 1992.
P. Glenisson , P. Antal , J. Mathys , and B. De Moor . Evaluation of the vector space representation in text-based gene clustering . In Proc of the Eighth Ann Pac Symp Biocomp (PSB 2003 ), pages 391 -- 402 , 2003 . P. Glenisson, P. Antal, J. Mathys, and B. De Moor. Evaluation of the vector space representation in text-based gene clustering. In Proc of the Eighth Ann Pac Symp Biocomp (PSB 2003), pages 391--402, 2003.
P. Glenisson , B. Coessens , S. Van Vooren , Y. Moreau , and B. De Moor . Text-based gene profiling with domain-specific views . In Proc of the First Int Workshop on Semantic Web and Databases (SWDB 2003 ), Berling, Germany, pages 15--31 , 2003 . P. Glenisson, B. Coessens, S. Van Vooren, Y. Moreau, and B. De Moor. Text-based gene profiling with domain-specific views. In Proc of the First Int Workshop on Semantic Web and Databases (SWDB 2003), Berling, Germany, pages 15--31, 2003.
A. Jain and R. Dubes . Algorithms for clustering data . Prentice Hall , 1988 . A. Jain and R. Dubes. Algorithms for clustering data. Prentice Hall, 1988.
P. Pavlidis , D. Lewis , and W. Noble . Exploring gene expression data with class scores . In Proc of the Seventh Ann Pac Symp Biocomp (PSB 2002) , 2002 . P. Pavlidis, D. Lewis, and W. Noble. Exploring gene expression data with class scores. In Proc of the Seventh Ann Pac Symp Biocomp (PSB 2002), 2002.
K. Pollard and M. van der Laan. A method to identify significant clusters in gene expression data. In To appear in Proc of Systemics , Cybernetics and Informatics 2002 (SCI 2002) , 2002 . K. Pollard and M. van der Laan. A method to identify significant clusters in gene expression data. In To appear in Proc of Systemics, Cybernetics and Informatics 2002 (SCI 2002), 2002.
M. Stephens , M. Palakal , S. Mukhopadhyay , R. Raje , and J. Mostafa . Detecting gene relations from MEDLINE abstracts . In Proc of the Sixth Ann Pac Symp Biocomp (PSB 2001) , 2001 . M. Stephens, M. Palakal, S. Mukhopadhyay, R. Raje, and J. Mostafa. Detecting gene relations from MEDLINE abstracts. In Proc of the Sixth Ann Pac Symp Biocomp (PSB 2001), 2001.