Word sense disambiguation as a traveling salesman problem

Artificial Intelligence Review - Tập 40 - Trang 405-427 - 2011

Kiem-Hieu Nguyen¹, Cheol-Young Ock¹

¹School of Electrical Engineering, University of Ulsan, Ulsan, Korea

Tóm tắt

Word sense disambiguation (WSD) is a difficult problem in Computational Linguistics, mostly because of the use of a fixed sense inventory and the deep level of granularity. This paper formulates WSD as a variant of the traveling salesman problem (TSP) to maximize the overall semantic relatedness of the context to be disambiguated. Ant colony optimization, a robust nature-inspired algorithm, was used in a reinforcement learning manner to solve the formulated TSP. We propose a novel measure based on the Lesk algorithm and Vector Space Model to calculate semantic relatedness. Our approach to WSD is comparable to state-of-the-art knowledge-based and unsupervised methods for benchmark datasets. In addition, we show that the combination of knowledge-based methods is superior to the most frequent sense heuristic and significantly reduces the difference between knowledge-based and supervised methods. The proposed approach could be customized for other lexical disambiguation tasks, such as Lexical Substitution or Word Domain Disambiguation.

Tài liệu tham khảo

Agirre E, Soroa A (2009) Personalizing PageRank for word sense disambiguation. In: Proceedings of the 12th conference of the European chapter of the ACL (EACL 2009). Association for Computational Linguistics, Athens, pp 33–41 Agirre E et al (2010) SemEval-2010 task 17: all-words word sense disambiguation on a specific domain. In: Proceedings of the 5th international workshop on semantic evaluation. Association for Computational Linguistics, Uppsala, pp 75–80 Banerjee S, Pedersen T (2002) An adapted lesk algorithm for word sense disambiguation using WordNet. In: Proceedings of the 3rd international conference on computational linguistics and intelligent text processing. CICLing ’02. Springer, London, pp 136–145 Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. Comput Netw ISDN Syst 30(1–7): 107–117 Buckley C (1985) Implementation of the SMART information retrieval system. Cornell University, Ithaca Budanitsky A, Hirst G (2006) Evaluating WordNet-based measures of lexical semantic relatedness. Comput Linguist 32(1): 13–47 Chan YS, Ng HT, Zhong Z (2007) NUS-PT: exploiting parallel texts for word sense disambiguation in the English all-words tasks. In: Proceedings of the 4th international workshop on semantic evaluations (SemEval-2007). Association for Computational Linguistics, Prague, pp 253–256 Cottrell GW (1989) A connectionist approach to word sense disambiguation. Morgan Kaufmann Publishers Inc, San Francisco Cowie J, Guthrie J, Guthrie L (1992) Lexical disambiguation using simulated annealing. In: Proceedings of the workshop on speech and natural language. Association for Computational Linguistics, HLT ’91. Stroudsburg, pp 238–242 Decadt B et al (2004) GAMBL, genetic algorithm optimization of memory-based WSD. In: Mihalcea R, Edmonds P (eds) Senseval-3: 3rd international workshop on the evaluation of systems for the semantic analysis of text. Association for Computational Linguistics, Barcelona, pp 108–112 Dorigo M, Gambardella LM (1997) Ant colonies for the traveling salesman problem Edmonds P, Cotton S (2002) SENSEVAL-2: overview. In: Proceedings of SENSEVAL-2: 2nd international workshop on evaluating word sense disambiguation systems, 5–6 July 2001, Toulouse, pp 1–7 Gelbukh AF, Sidorov G, Han S-Y (2005) On some optimization heuristics for lesk-like WSD algorithms. In: NLDB, pp 402–405 Hatori J, Miyao Y, Tsujii J (2008) Word sense disambiguation for all words using tree-structured conditional random fields. In: Coling 2008: companion volume: posters. Coling 2008. Organizing Committee, Manchester, pp 43–46 Hirst GJ (1984) Semantic interpretation against ambiguity. Brown University, Providence Ide N, Véronis J (1998) Introduction to the special issue on word sense disambiguation: the state of the art. Comput Linguist 24(1): 2–40 Jiang JJ, Conrath DW (1997) Semantic similarity based on corpus statistics and lexical taxonomy. In: International conference research on computational linguistics (ROCLING X), pp 19–33 Kleinberg JM (1999) Hubs, authorities, and communities. ACM Comput Surv 31(4es):21–23 Laporte G (1992) The traveling salesman problem: an overview of exact and approximate algorithms. Eur J Oper Res 59(2): 231–247 Leacock C, Chodorow M (1998) Combining local context and WordNet similarity for word sense identification. In: Fellbaum C (ed) WordNet: an electronic lexical database. MIT Press, Cambridge, pp 305–332 Lee YK, Ng HT, Chia TK (2004) Supervised word sense disambiguation with support vector machines and multiple knowledge sources. In: Mihalcea R, Edmonds P (eds) Senseval-3: 3rd international workshop on the evaluation of systems for the semantic analysis of text. Association for Computational Linguistics, Barcelona, pp 137–140 Lesk M (1986) Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In: Proceedings of the 5th annual international conference on systems documentation. SIGDOC ’86. ACM, New York, pp 24–26 Lin D (1998) An information-theoretic definition of similarity. In: Proceedings of the 15th international conference on machine learning. ICML ’98. Morgan Kaufmann Publishers Inc, San Francisco, pp 296–304 Mihalcea R, Moldovan DI (2001) eXtended WordNet: progress report. In: Proceedings of NAACL workshop on WordNet and other lexical resources, pp 95–100 Miller GA (1995) WordNet: a lexical database for English. Commun ACM 38(11): 39–41 Miller GA, Charles WG (1991) Contextual correlates of semantic similarity. Lang Cogn Process 6: 1–28 Miller GA et al (1993) A semantic concordance. In: Proceedings of the workshop on human language technology. HLT ’93. Association for Computational Linguistics, Stroudsburg, pp 303–308 Molina A, Pla F, Segarra E (2002) A hidden Markov model approach to word sense disambiguation. In: Proceedings of the 8th Ibero-American conference on AI: advances in artificial intelligence. IBERAMIA. Springer, London, pp 655–663 Navigli R (2009) Word sense disambiguation: a survey. ACM Comput Surv 41(2): 10–11069 Navigli R, Velardi P (2005) Structural semantic interconnections: a knowledge-based approach to word sense disambiguation. IEEE Trans Pattern Anal Mach Intell 27(7): 1075–1086 Navigli R, Litkowski KC, Hargraves O (2007) SemEval-2007 task 07: coarse-grained English all-words task. In: Proceedings of the 4th international workshop on semantic evaluations (SemEval-2007). Association for Computational Linguistics, Prague, pp 30–35 Page L et al (1999) The PageRank citation ranking: bringing order to the web. Stanford InfoLab Pedersen T, Banerjee S, Patwardhan S (2005) Maximizing semantic relatedness to perform word sense disambiguation Pedersen T, Bruce R (1997) Distinguishing word senses in untagged text. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP-97), Providence Ponzetto SP, Navigli R (2010) Knowledge-rich word sense disambiguation rivaling supervised systems. In: Proceedings of the 48th annual meeting of the association for computational linguistics. Association for Computational Linguistics, Uppsala, pp 1522–1531 Ponzetto SP, Strube M (2011) Taxonomy induction based on a collaboratively built knowledge repository. Artif Intell 175(9–10): 1737–1756 Popescu M, Hristea F (2011) State of the art versus classical clustering for unsupervised word sense disambiguation. Artif Intell Rev 35(3): 241–264 Resnik P (1995) Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th international joint conference on artificial intelligence—volume 1. Morgan Kaufmann Publishers Inc, San Francisco, pp 448–453 Ruiz-Casado M, Alfonseca E, Castells P (2005) Automatic assignment of wikipedia encyclopedic entries to WordNet Synsets. In: Advances in web intelligence, pp 380–386 Schwab D, Guillaume N (2011) A global ant colony algorithm for word sense disambiguation based on semantic relatedness. In: Pérez J et al (ed) Highlights in practical applications of agents and multiagent systems. Advances in intelligent and soft computing. Springer, Berlin, pp 257–264 Segond F et al (1997) An experiment in semantic tagging using hidden Markov model tagging. In: ACL/EACL workshop on automatic information extraction and building of lexical semantic resources for NLP applications, pp 78–81 Sinha R, Mihalcea R (2007) Unsupervised graph-based word sense disambiguation using measures of word semantic similarity. In: Proceedings of the international conference on semantic computing. IEEE Computer Society, Washington, pp 363–369 Snyder B, Palmer M (2004) The English all-words task. In: Mihalcea R, Edmonds P (eds) Senseval-3: 3rd international workshop on the evaluation of systems for the semantic analysis of text. Association for Computational Linguistics, Barcelona, pp 41–43 Tran A et al (2010) TreeMatch: a fully unsupervised WSD system using dependency knowledge on a specific domain. In: Proceedings of the 5th international workshop on semantic evaluation. SemEval ’10. Association for Computational Linguistics, Stroudsburg, pp 396–401 Tsatsaronis G, Vazirgiannis M, Androutsopoulos I (2007) Word sense disambiguation with spreading activation networks generated from thesauri. In: Proceedings of the 20th international joint conference on artifical intelligence. Morgan Kaufmann Publishers Inc, San Francisco, pp 1725–1730 Vasilescu F, Langlais P, Lapalme G (2004) Evaluating variants of the lesk approach for disambiguating words. In: Proceedings of LREC 2004. Lisbonne, pp 633–636 Veronis J, Ide NM (1990) Word sense disambiguation with very large neural networks extracted from machine readable dictionaries. In: Proceedings of the 13th conference on computational linguistics—volume 2. COLING ’90. Association for Computational Linguistics, Morristown, pp 389–394 Wu Z, Palmer M (1994) Verbs semantics and lexical selection. In: Proceedings of the 32nd annual meeting on Association for Computational Linguistics. ACL ’94. Association for Computational Linguistics, Stroudsburg, pp 133–138 Yepes AJ, Aronson A (2010) Knowledge-based biomedical word sense disambiguation: comparison of approaches. BMC Bioinformatics 11(1): 569+ Yuret D, Yatbaz MA (2010) The noisy channel model for unsupervised word sense disambiguation. Comput Linguist 36(1): 111–127 Zhang C, Zhou Y, Martin T (2008) Genetic word sense disambiguation algorithm. In: Proceedings of the 2008 2nd international symposium on intelligent information technology application—volume 01. IITA ’08. IEEE Computer Society, Washington, pp 123–127

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Về chúng tôi

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích các bài báo, công bố khoa học Việt Nam. Công cụ trợ giúp người nghiên cứu, tạp chí, đơn vị nghiên cứu tra cứu, phân tích và thống kê dữ liệu nghiên cứu khoa học tại Việt Nam và quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia vào Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Công cụ kiểm tra chính tả và thể thức Viver

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA