Protein names and how to find them
Tài liệu tham khảo
F. Olsson, P. Hansen, K. Franzén, J. Karlgren. Information access and refinement — a research theme, Ercim News 46 (2001).
Grishman, 1997, Information extraction: techniques and challenges, 10
Proceedings of the Seventh Message Understanding Conference (MUC-7), Morgan Kaufmann, Virginia USA, April–May 1998.
Proceedings of the Sixth Message Understanding Conference (MUC-6), Morgan Kaufmann, Columbia, MD USA, November 1995.
Proceedings of the Fifth Message Understanding Conference (MUC-5), Morgan Kaufmann, Baltimore, MD, USA, August 1993.
Proceedings of the Fourth Message Understanding Conference (MUC-4), Morgan Kaufmann, June 1992.
Proceedings of the Third Message Understanding Conference (MUC-3), Morgan Kaufmann, May 1991.
A. Borthwick, J. Sterling, E. Agichtein, R. Grishman, Exploiting diverse knowledge sources via maximum entropy in named entity recognition, in: Proceedings of the Sixth Workshop on Very Large Corpora, Montreal, Canada, August 1998.
C. Nobata, N. Collier, J. Tsujii, Automatic term identification and classification in biology texts, in: Proceedings of the Natural Language Pacific Rim Symposium (NLPRS'2000), November 1999, pp. 369–374.
N. Collier, C. Nobata, J. Tsujii, Extracting the name of genes and gene products with a Hidden Markov Model, in: Proceedings of the 18th International Conference on Computational Linguistics (COLING-2000), August 2000, pp. 201–207.
K. Fukuda, T. Tsunoda, A. Tamura, T. Takagi, Toward Information extraction: identifying protein names from biological papers, in: Proceedings of the Pacific Symposium on Biocamputing (PSB'98), Maui, Hawaii, 4–9 January 1998, pp. 705–716.
R. Gaizauskas, K. Humphreys, G. Demetriou, Information extraction from biological science journal articles: enzyme interactions and protein structures, in: M.G. Hicks (Ed.), Proceedings of the Workshop Chemical Data Analysis in the Large: the Challenge of the Automation Age, 2001.
Bairoch, 2000, The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000, Nucl. Acids Res., 28, 45, 10.1093/nar/28.1.45
P. Tapanainen, T. Järvinen, A non-projective dependency parser, In: Proceedings of the fifth Conference on Applied Natural Language Processing, Association for Computational Linguistics, Washington DC, April 1997, pp. 64–71.
B. de Bruijn, J. Martin, Protein name tagging, Presented as a poster at the eighth International Conference on Intelligent Systems for Molecular Biology (ISMB'00), 2000.
N. Collier, H.S. Park, N. Ogata, Y. Tateishi, C. Nobata, T. Ohta, T. Sekimizu, H. Imai, K. Ibushi, J. Tsujii, The GENIA project: corpus-based knowledge acquisition and information extraction from genome research papers, In: Proceedings of the ninth Conference of the European Chapter of the Association for Computational Linguistics (EACL), June 1999, pp. 271–272.