Placing search in context

ACM Transactions on Information Systems - Tập 20 Số 1 - Trang 116-131 - 2002

Tóm tắt

Keyword-based search engines are in widespread use today as a popular means for Web-based information retrieval. Although such systems seem deceptively simple, a considerable amount of skill is required in order to satisfy non-trivial information needs. This paper presents a new conceptual paradigm for performing search in context, that largely automates the search process, providing even non-professional users with highly relevant results. This paradigm is implemented in practice in the IntelliZap system, where search is initiated from a text query marked by the user in a document she views, and is guided by the text surrounding the marked query in that document ("the context"). The context-driven information retrieval process involves semantic keyword extraction and clustering to automatically generate new, augmented queries. The latter are submitted to a host of general and domain-specific search engines. Search results are then semantically reranked, using context. Experimental results testify that using context to guide search, effectively offers even inexperienced users an advanced search tool on the Web.

Từ khóa


Tài liệu tham khảo

BHARAT K. 2000. SearchPad: Explicit capture of search context to support web search. In Proceedings of the 9th International World Wide Web Conference WWW9 (Amsterdam May).

BUDZIK J.AND HAMMOND K. J. 2000. User interactions with everyday applications as context for just-in-time information access. In Proceedings of the 2000 International Conference on Intelligent User Interfaces (New Orleans Louisiana) ACM Press New York NY 44-51. 10.1145/325737.325776

BUTLER D. 2000. Souped-up search engines. Nature Vol. 405 (May) 112-115.

DUDA R.O.AND HART P. E. 1973. Pattern Classification and Scene Analysis. John Wiley and Sons New York NY.

EUROWORDNET. http://www.hum.uva.nl/cewn/

FELLBAUM C. ED. 1998. WordNet An Electronic Lexical Database. MIT Press The WordNet database is available online at http://www.cogsci.princeton.edu/edn.

FUKUNAGA K. 1990. Introduction to Statistical Pattern Recognition. Academic Press San Diego CA.

GLOVER E.ET AL. 1999. Architecture of a meta search engine that supports user information needs. In Proceedings of the 8th International Conference on Information and Knowledge Management CIKM 99 (Kansas City MO November) 210-216. 10.1145/319950.319980

GOOGLE. The basics of Google search. http://www.google.com/help/basics.html

LANDAUER T. K. FOLTZ P.W. AND LAHAM D. 1998. Introduction to Latent Semantic Analysis. Dis. Proc. 25 2 & 3 259-284.

LAWRENCE S. 2000. Context in web search. Data Engineering IEEE Comput. Soc 23 3 (September) 25-32. http://www.research.microsoft.com/research/db/debull/A00sept/lawrence.ps

LEXISNEXIS. 2001. LexisNexis delivers Smart Tags for Microsoft Office XP http://www.lexisnexis. com/lncc/about/newsreleases/0412.html.

MICROSOFT OFFICE XP. 2001. At a glance: Smart Tags http://www.microsoft.com/Partner/ BusinessDevelopment/SalesResources/factsheets/SmartTags.asp.

MILLER G. A. AND CHARLES W. G. 1991. Contextual correlates of semantic similarity. Lang. Cog. Pro. 6 1 1-28.

OPHER I. HORN D. AND QUENET B. 1999. Clustering with Spiking Neurons. In Proceedings of the International Conference on Artificial Neural Networks (ICANN' 99 Edinburgh Scotland September) 485-490.

PRESS W. H. TEUKOLSKY S. A. VETTERLING W.T. AND FLANNERY B. P. 1992. Numerical Recipes in C: The Art of Scientific Computing 2nd ed. Cambridge University Press Section 14.3 620-628.

RESNIK P. 1999. Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language. J. Art. Intell. Re. 11 95-130.

SHERMAN C. 2000a. Inktomi inside. http://websearch.about.com/internet/websearch/library/ weekly/aa041900a.htm.

SHERMAN C. 2000b. Link building strategies. http://websearch.about.com/internet/websearch/ library/weekly/aa082300a.htm.

SULLIVAN D. 2000. Numbers numbersbut what do they mean? The Search Engine Report (March 3). http://searchenginewatch.com/sereport/00/03-numbers.html.

XU J.AND CROFT W. B. 2000. Improving the effectiveness of information retrieval with local context analysis. ACM TOIS 18 1 (January) 79-112. 10.1145/333135.333138

YOKOI T. 1995. The EDR electronic dictionary. Comm. ACM 38 11 (November) ACM Press New York NY 42-44. 10.1145/219717.219752