Information retrieval on the web

ACM Computing Surveys - Tập 32 Số 2 - Trang 144-173 - 2000
Mei Kobayashi1, Koichi Takeda1
1IBM Research, Kanagawa-ken, Japan

Tóm tắt

In this paper we review studies of the growth of the Internet and technologies that are useful for information search and retrieval on the Web. We present data on the Internet from several different sources, e.g., current as well as projected number of users, hosts, and Web sites. Although numerical figures vary, overall trends cited by the sources are consistent and point to exponential growth in the past and in the coming decade. Hence it is not surprising that about 85% of Internet users surveyed claim using search engines and search services to find specific information. The same surveys show, however, that users are not satisfied with the performance of the current generation of search engines; the slow retrieval speed, communication delays, and poor quality of retrieved results (e.g., noise and broken links) are commonly cited problems. We discuss the development of new techniques targeted to resolve some of the problems associated with Web-based information retrieval and speculate on future trends.

Từ khóa


Tài liệu tham khảo

ASSOCIATION FOR COMPUTING MACHINERY. 2000 . SIGCHI: Special Interest Group on Computer- Human Interaction. Home page: www.acm. org/sigchi/]] ASSOCIATION FOR COMPUTING MACHINERY. 2000. SIGCHI: Special Interest Group on Computer- Human Interaction. Home page: www.acm. org/sigchi/]]

ASSOCIATION FOR COMPUTING MACHINERY. 2000. SI- GIR: Special Interest Group on Information Retrieval. Home page: www.acm.org/sigir/]] ASSOCIATION FOR COMPUTING MACHINERY. 2000. SI- GIR: Special Interest Group on Information Retrieval. Home page: www.acm.org/sigir/]]

AGOSTI , M. AND SMEATON , A. 1996. Information Retrieval and Hypertext . Kluwer Academic Publishers , Hingham, MA .]] AGOSTI,M.AND SMEATON, A. 1996. Information Retrieval and Hypertext. Kluwer Academic Publishers, Hingham, MA.]]

10.1145/276304.276314

10.1145/191666.191775

AHLBERG , C. AND SHNEIDERMAN , B. 1997 . The alphaslider: A compact and rapid and selector . In Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI '97 , Atlanta, GA, Mar. 22-27), S. Pemberton, Ed. ACM Press, New York, NY.]] AHLBERG,C.AND SHNEIDERMAN, B. 1997. The alphaslider: A compact and rapid and selector. In Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI '97, Atlanta, GA, Mar. 22-27), S. Pemberton, Ed. ACM Press, New York, NY.]]

AI MAG . 1997 . Special issue on intelligent systems on the internet . AI Mag. 18 , 4 .]] AI MAG. 1997. Special issue on intelligent systems on the internet. AI Mag. 18,4.]]

ANDERBERG , M. R. 1973. Cluster Analysis for Applications . Academic Press, Inc. , New York, NY .]] ANDERBERG, M. R. 1973. Cluster Analysis for Applications. Academic Press, Inc., New York, NY.]]

10.1145/278459.258601

10.1109/69.234774

BAEZA-YATES , R. A. 1992 . Introduction to data structures and algorithms related to information retrieval. In Information Retrieval: Data Structures and Algorithms, W. B. Frakes and R. Baeza-Yates, Eds. Prentice-Hall, Inc., Upper Saddle River , NJ , 13 - 27 .]] BAEZA-YATES, R. A. 1992. Introduction to data structures and algorithms related to information retrieval. In Information Retrieval: Data Structures and Algorithms, W. B. Frakes and R. Baeza-Yates, Eds. Prentice-Hall, Inc., Upper Saddle River, NJ, 13-27.]]

BAEZA-YATES , R. AND RIBEIRO-NETO , B. 1999. Modern Information Retrieval . Addison-Wesley , Reading, MA .]] BAEZA-YATES,R.AND RIBEIRO-NETO, B. 1999. Modern Information Retrieval. Addison-Wesley, Reading, MA.]]

10.1145/267658.267744

BALABANOVIC , M. AND SHOHAM , Y. 1995 . Learning information retrieval agents: Experiments with automated web browsing . In Proceedings of the 1995 AAAI Spring Symposium on Information Gathering from Heterogenous Distributed Environments ( Stanford, CA, Mar.). AAAI Press, Menlo Park, CA.]] BALABANOVIC,M.AND SHOHAM, Y. 1995. Learning information retrieval agents: Experiments with automated web browsing. In Proceedings of the 1995 AAAI Spring Symposium on Information Gathering from Heterogenous Distributed Environments (Stanford, CA, Mar.). AAAI Press, Menlo Park, CA.]]

BALABANOVIC , M. , SHOHAM , Y. , AND YUN , T. 1995. An adaptive agent for automated web browsing. Stanford Univ. Digital Libraries Project, working paper 1995-0023 . Stanford University , Stanford, CA .]] BALABANOVIC, M., SHOHAM, Y., AND YUN, T. 1995. An adaptive agent for automated web browsing. Stanford Univ. Digital Libraries Project, working paper 1995-0023. Stanford University, Stanford, CA.]]

10.1145/258549.258563

10.1145/290941.290958

BARZILAI AND DAVIDSON. 1997. Computer-based electronic bid auction and sale system and a system to teach new/non-registered customers how bidding auction purchasing works: U.S. Patent no. 60112045.]] BARZILAI AND DAVIDSON. 1997. Computer-based electronic bid auction and sale system and a system to teach new/non-registered customers how bidding auction purchasing works: U.S. Patent no. 60112045.]]

BEAUDOIN , L. , PARENT , M.-A. , AND VROOMEN , L.C. 1996 . Cheops: A compact explorer for complex hierarchies. In Proceedings of the IEEE Conference on Visualization (San Francisco, CA, Oct. 27-Nov. 1) , R. Yagel and G. M. Nielson, Eds. IEEE Computer Society Press , Los Alamitos, CA , 87ff.]] BEAUDOIN, L., PARENT, M.-A., AND VROOMEN,L.C. 1996. Cheops: A compact explorer for complex hierarchies. In Proceedings of the IEEE Conference on Visualization (San Francisco, CA, Oct. 27-Nov. 1), R. Yagel and G. M. Nielson, Eds. IEEE Computer Society Press, Los Alamitos, CA, 87ff.]]

10.1145/192426.192435

BERENT T. HURST D. PATTON T. TABERNIK T. REIG J.W.D. AND WHITTLE W. 1998. Electronic on-line motor vehicle auction and information system: U.S. Patent no. 5774873.]] BERENT T. HURST D. PATTON T. TABERNIK T. REIG J.W.D. AND WHITTLE W. 1998. Electronic on-line motor vehicle auction and information system: U.S. Patent no. 5774873.]]

10.1145/179606.179671

BERRY , M. AND BROWN , M. 1999. Understanding Search Engines . SIAM, Philadelphia , PA .]] BERRY,M.AND BROWN, M. 1999. Understanding Search Engines. SIAM, Philadelphia, PA.]]

10.1137/1037127

10.5555/297805.297863

10.1147/rd.422.0233

BORKO , H. 1979 . Inter-indexer consistency . In Proceedings of the Cranfield Conference.]] BORKO, H. 1979. Inter-indexer consistency. In Proceedings of the Cranfield Conference.]]

10.1145/146802.146826

BRAKE , D. 1997 . Lost in cyberspace. New Sci. Mag. www.newscientist.com/keysites/networld/ lost.html]] BRAKE, D. 1997. Lost in cyberspace. New Sci. Mag. www.newscientist.com/keysites/networld/ lost.html]]

10.1016/S0169-7552(98)00110-X

10.1016/S0169-7552(97)00031-7

BUSINESS WEEK. 1997. Special report on speech technologies. Business Week.]] BUSINESS WEEK. 1997. Special report on speech technologies. Business Week.]]

COMMUNICATIONS OF THE ACM. 1993. Special issue on the next generation GUIs. Commun. ACM.]] COMMUNICATIONS OF THE ACM. 1993. Special issue on the next generation GUIs. Commun. ACM.]]

COMMUNICATIONS OF THE ACM. 1994. Special issue on internet technology. Commun. ACM.]] COMMUNICATIONS OF THE ACM. 1994. Special issue on internet technology. Commun. ACM.]]

COMMUNICATIONS OF THE ACM. 1995. Special issues on digital libraries. Commun. ACM.]] COMMUNICATIONS OF THE ACM. 1995. Special issues on digital libraries. Commun. ACM.]]

COMMUNICATIONS OF THE ACM. 1999. Special issues on knowledge discovery. Commun. ACM.]] COMMUNICATIONS OF THE ACM. 1999. Special issues on knowledge discovery. Commun. ACM.]]

CARD , S. , MACKINLAY , J. , AND SHNEIDERMAN , B. 1999. Readings in Information Visualization: Using Vision to Think . Morgan Kaufmann Publishers Inc ., San Francisco, CA.]] CARD, S., MACKINLAY, J., AND SHNEIDERMAN,B. 1999. Readings in Information Visualization: Using Vision to Think. Morgan Kaufmann Publishers Inc., San Francisco, CA.]]

10.1145/238386.238446

CARL , J. 1995 . Protocol gives sites way to keep out the 'bots '. Web Week 1 , 7 (Nov.).]] CARL, J. 1995. Protocol gives sites way to keep out the 'bots'. Web Week 1, 7 (Nov.).]]

CARRI~RE , J. AND KAZMAN , R. 1997 . WebQuery: Searching and visualizing the Web through connectivity . In Proceedings of the Sixth International Conference on the World Wide Web (Santa Clara CA, Apr.).]] CARRI~RE,J.AND KAZMAN, R. 1997. WebQuery: Searching and visualizing the Web through connectivity. In Proceedings of the Sixth International Conference on the World Wide Web (Santa Clara CA, Apr.).]]

10.1147/rd.422.0253

CATHRO , W. 1997 . Matching discovery and recovery . In Proceedings of the Seminar on Standards Australia. www.nla.gov.au/staffpaper/cathro3.html]] CATHRO, W. 1997. Matching discovery and recovery. In Proceedings of the Seminar on Standards Australia. www.nla.gov.au/staffpaper/cathro3.html]]

CHAKRABARTI , S. , DOM , B. , GIBSON , D. , KUMAR , S. , RAGHAVAN , P. , RAJAGOPALAN ,, S., AND TOMKINS , A. 1988 . Experiments in topic distillation . In Proceedings of the ACM SIGIR Workshop on Hypertext Information Retrieval for the Web (Apr.). ACM Press , New York, NY.]] CHAKRABARTI, S., DOM, B., GIBSON, D., KUMAR, S., RAGHAVAN, P., RAJAGOPALAN,, S., AND TOMKINS, A. 1988. Experiments in topic distillation. In Proceedings of the ACM SIGIR Workshop on Hypertext Information Retrieval for the Web (Apr.). ACM Press, New York, NY.]]

10.5555/297805.297821

CHAKRABARTI S.AND RAJAGOPALAN S. 1997. Survey of information retrieval research and products. Home page: w3.almaden.ibm.com/ soumen/ir.html]] CHAKRABARTI S.AND RAJAGOPALAN S. 1997. Survey of information retrieval research and products. Home page: w3.almaden.ibm.com/ soumen/ir.html]]

10.1145/133160.133215

CHANDRASEKARAN R. 1998. "Portals" offer one-stop surfing on the net. Int. Herald Tribune 19/21.]] CHANDRASEKARAN R. 1998. "Portals" offer one-stop surfing on the net. Int. Herald Tribune 19/21.]]

CHANG , S.-F. 1995 . Compressed domain techniques for image/video indexing and manipulation . In Proceedings of the Conference on Information Processing.]] CHANG, S.-F. 1995. Compressed domain techniques for image/video indexing and manipulation. In Proceedings of the Conference on Information Processing.]]

10.1016/S0169-7552(98)00108-1

CLARK , D. 2000. Shopbots become agents for business change . IEEE Computer .]] CLARK, D. 2000. Shopbots become agents for business change. IEEE Computer.]]

10.1108/eb026487

COHEN A. 1999. The attic of e. Time Mag.]] COHEN A. 1999. The attic of e. Time Mag.]]

COMPUT . NETW . ISDN SYST . 2000 . World Wide Web conferences. 1995-2000 . Comput. Netw. ISDN Syst. www.w3.org/Conferences/Overview- WWW.html]] COMPUT.NETW. ISDN SYST. 2000. World Wide Web conferences. 1995-2000. Comput. Netw. ISDN Syst. www.w3.org/Conferences/Overview- WWW.html]]

10.1002/asi.4630200314

10.1145/280324.280336

10.1145/299917.299920

CUNNINGHAM M. 1997. Brewster's millions. Irish Times.www.irish-times.com/irish-times/paper/ 1997/0127/cmp1.html]] CUNNINGHAM M. 1997. Brewster's millions. Irish Times.www.irish-times.com/irish-times/paper/ 1997/0127/cmp1.html]]

10.1145/160688.160706

10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9

DEMMEL , J. W. 1997. Applied Numerical Linear Algebra . SIAM , Philadelphia, PA .]] DEMMEL, J. W. 1997. Applied Numerical Linear Algebra. SIAM, Philadelphia, PA.]]

DHILLON , I. AND MOHDA , D. 1999 . A data-clustering algorithm on distributed memory multiprocessors . In Proceedings of the Workshop on Large-Scale Parallel KDD Systems (ACM SIGKDD., Aug. 15-18) . ACM Press, New York, NY.]] DHILLON,I.AND MOHDA, D. 1999. A data-clustering algorithm on distributed memory multiprocessors. In Proceedings of the Workshop on Large-Scale Parallel KDD Systems (ACM SIGKDD., Aug. 15-18). ACM Press, New York, NY.]]

DHILLON I.AND MODHA D. 2000. Concept decompositions for large sparse text data using clustering. Mach. Learn.]] DHILLON I.AND MODHA D. 2000. Concept decompositions for large sparse text data using clustering. Mach. Learn.]]

ECHIGO , T. , KUROKAWA , M. , TOMITA , A. , TOMITA , A. , MIYAMORI , AND II SAKU , S. 2000 . Video enrichment: Retrieval and enhanced visualization based on behaviors of objects . In Proceedings of the Fourth Asian Conference on Computer Vision (ACCV2000 , Jan. 8-11 ). 364-369.]] ECHIGO, T., KUROKAWA, M., TOMITA, A., TOMITA, A., MIYAMORI, AND IISAKU, S. 2000. Video enrichment: Retrieval and enhanced visualization based on behaviors of objects. In Proceedings of the Fourth Asian Conference on Computer Vision (ACCV2000, Jan. 8-11). 364-369.]]

10.1145/290941.290959

ESTER , M. , KRIEGEL , H.-S. , SANDER , J. , AND XU , X. 1995 a. A density-based algorithm for discovering clusters in large spatial databases with noise . In Proceedings of the First International Conference on Knowledge Discovery and Data Mining ( Montreal, Canada, Aug. 20-21).]] ESTER, M., KRIEGEL, H.-S., SANDER, J., AND XU,X. 1995a. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the First International Conference on Knowledge Discovery and Data Mining (Montreal, Canada, Aug. 20-21).]]

10.5555/3001335.3001351

ESTER , M. , KRIEGEL , H.-S. , AND XU , X. 1995 c. Focusing techniques for efficient class identification . In Proceedings of the Fourth International Symposium on Large Spatial Databases.]] ESTER, M., KRIEGEL, H.-S., AND XU, X. 1995c. Focusing techniques for efficient class identification. In Proceedings of the Fourth International Symposium on Large Spatial Databases.]]

FALOUTSOS , C. 1996. Searching Multimedia Databases by Content . Kluwer Academic Publishers , Hingham, MA .]] FALOUTSOS, C. 1996. Searching Multimedia Databases by Content. Kluwer Academic Publishers, Hingham, MA.]]

10.1145/223784.223812

FALOUTSOS , C. AND OARD , D. W. 1995. A survey of information retrieval and filtering methods . Univ. of Maryland Institute for Advanced Computer Studies Report. University of Maryland at College Park , College Park, MD .]] FALOUTSOS,C.AND OARD, D. W. 1995. A survey of information retrieval and filtering methods. Univ. of Maryland Institute for Advanced Computer Studies Report. University of Maryland at College Park, College Park, MD.]]

FELDMAN S. 1998. Web search services in 1998: Trends and challenges. Inf. Today 9.]] FELDMAN S. 1998. Web search services in 1998: Trends and challenges. Inf. Today 9.]]

FERGUSON A. 1999. Auction nation. Time Mag.]] FERGUSON A. 1999. Auction nation. Time Mag.]]

FININ , T. , NICHOLAS , C. , AND MAYFIELD , J. 1998 . Software agents for information retrieval (short course notes) . In Proceedings of the Third ACM Conference on Digital Libraries (DL '98 , Pittsburgh, PA, June 23-26), I. Witten, R. Akscyn, and F. M. Shipman, Eds. ACM Press, New York, NY.]] FININ, T., NICHOLAS, C., AND MAYFIELD, J. 1998. Software agents for information retrieval (short course notes). In Proceedings of the Third ACM Conference on Digital Libraries (DL '98, Pittsburgh, PA, June 23-26), I. Witten, R. Akscyn, and F. M. Shipman, Eds. ACM Press, New York, NY.]]

10.1145/218380.218454

FLICKNER , M. , SAWHNEY , H. , NIBLACK , W. , ASHLEY , J. , HUANG , Q. , DOM , B. , GORKANI , M. , HAFNER , J. , LEE , D. , PETKOVIC , D. , STEELE , D. , AND YANKER , P. 1997. Query by image and video content: the QBIC system . In Intelligent Multimedia Information Retrieval, M. T. Maybury, Ed. MIT Press, Cambridge , MA , 7-22.]] FLICKNER, M., SAWHNEY, H., NIBLACK, W., ASHLEY, J., HUANG, Q., DOM, B., GORKANI, M., HAFNER, J., LEE, D., PETKOVIC, D., STEELE, D., AND YANKER, P. 1997. Query by image and video content: the QBIC system. In Intelligent Multimedia Information Retrieval, M. T. Maybury, Ed. MIT Press, Cambridge, MA, 7-22.]]

FLYNN L. 1996. Desperately seeking surfers: Web programmers try to alter search engines' results. New York Times.]] FLYNN L. 1996. Desperately seeking surfers: Web programmers try to alter search engines' results. New York Times.]]

FRAKES , W.B. AND BAEZA-YATES , R. , EDS. 1992 . Information Retrieval: Data Structures and Algorithms . Prentice-Hall, Inc. , Upper Saddle River, NJ.]] FRAKES,W.B.AND BAEZA-YATES, R., EDS. 1992. Information Retrieval: Data Structures and Algorithms. Prentice-Hall, Inc., Upper Saddle River, NJ.]]

10.1145/48511.48518

10.1006/jvlc.1998.0087

GOLUB , G.H. AND VAN LOAN , C. F. 1996. Matrix Computations . 3rd. Johns Hopkins studies in the mathematical sciences . Johns Hopkins University Press, Baltimore , MD .]] GOLUB,G.H.AND VAN LOAN, C. F. 1996. Matrix Computations. 3rd. Johns Hopkins studies in the mathematical sciences. Johns Hopkins University Press, Baltimore, MD.]]

10.1109/4236.623969

GUGLIELMO C. 1997. Upside today (on-line). home page: inc.com/cgi-bin/tech link.cgi?url=http:// www.upside.com.]] GUGLIELMO C. 1997. Upside today (on-line). home page: inc.com/cgi-bin/tech link.cgi?url=http:// www.upside.com.]]

10.1145/276304.276312

HAWKING D. CRASWELL N. THISTLEWAITE P. AND HARMAN D. 1999. Results and Challenges in Web Search Evaluation.]] HAWKING D. CRASWELL N. THISTLEWAITE P. AND HARMAN D. 1999. Results and Challenges in Web Search Evaluation.]]

10.1145/223904.223912

HEARST , M. 1997 . Interfaces for searching the web. Sci . Am. , 68 - 72 .]] HEARST, M. 1997. Interfaces for searching the web. Sci. Am., 68-72.]]

HEARST , M. 1999. User interfaces and visualization . In Modern Information Retrieval, R. Baeza-Yates and B. Ribeiro-Neto . Addison-Wesley , Reading, MA , 2257-3232.]] HEARST, M. 1999. User interfaces and visualization. In Modern Information Retrieval, R. Baeza-Yates and B. Ribeiro-Neto. Addison-Wesley, Reading, MA, 2257-3232.]]

10.1145/257089.257392

HENZINGER M. HEYDON A. MITZENMACHER M. AND NAJORK M. 1999. Measuring index quality using random walks on the web.]] HENZINGER M. HEYDON A. MITZENMACHER M. AND NAJORK M. 1999. Measuring index quality using random walks on the web.]]

HOWE , A. AND DREILINGER , D. 1997 . Savvysearch: A metasearch engine that learns which search engine to query . AI Mag. 18 , 2 , 19 - 25 .]] HOWE,A.AND DREILINGER, D. 1997. Savvysearch: A metasearch engine that learns which search engine to query. AI Mag. 18, 2, 19-25.]]

10.1126/science.277.5325.535

10.1126/science.280.5360.95

HYLTON J. 1996. Identifying and merging related bibliographic records. Master's Thesis.]] HYLTON J. 1996. Identifying and merging related bibliographic records. Master's Thesis.]]

IEEE . 1999. Special issue on intelligent information retrieval . IEEE Expert .]] IEEE. 1999. Special issue on intelligent information retrieval. IEEE Expert.]]

IEEE . 1998a. News and trends section . IEEE Internet Comput .]] IEEE. 1998a. News and trends section. IEEE Internet Comput.]]

IEEE . 1998b. Special issue on knowledge management . IEEE Expert .]] IEEE. 1998b. Special issue on knowledge management. IEEE Expert.]]

IEEE . 1996a. Special issue on intelligent agents . IEEE Expert/Intelligent Systems and Their Applications .]] IEEE. 1996a. Special issue on intelligent agents. IEEE Expert/Intelligent Systems and Their Applications.]]

10.1109/TPAMI.1996.531797

IFIP . 1989. Visual Data Base Systems I and II . Elsevier North-Holland, Inc. , Amsterdam, The Netherlands.]] IFIP. 1989. Visual Data Base Systems I and II. Elsevier North-Holland, Inc., Amsterdam, The Netherlands.]]

IWAI , Y. , MARUO , J. , YACHIDA , M. , ECHIGO , T. , AND IISAKU , S. 2000 . A framework for visual event extraction from soccer games . In Proceedings of the Fourth Asian Conference on Computer Vision (ACCV2000 , Jan. 8-11 ). 222-227.]] IWAI, Y., MARUO, J., YACHIDA, M., ECHIGO, T., AND IISAKU, S. 2000. A framework for visual event extraction from soccer games. In Proceedings of the Fourth Asian Conference on Computer Vision (ACCV2000, Jan. 8-11). 222-227.]]

JACOBY , J. AND SLAMECKA , V. 1962. Indexer consistency under minimal conditions. RADC TR 62--426. Documentation , Inc., Bethesda, MD, US. ]] JACOBY,J.AND SLAMECKA, V. 1962. Indexer consistency under minimal conditions. RADC TR 62--426. Documentation, Inc., Bethesda, MD, US.]]

KAGEYAMA , T. AND TAKASHIMA , Y. 1994 . A melody retrieval method with hummed melody. IEICE Trans. Inf. Syst. J77 , 8 (Aug.), 1543-1551.]] KAGEYAMA,T.AND TAKASHIMA, Y. 1994. A melody retrieval method with hummed melody. IEICE Trans. Inf. Syst. J77, 8 (Aug.), 1543-1551.]]

KAHLE B. 1999. Archiving the Internet. home page: www.alexa.com/ brewster/essays/sciam article.html]] KAHLE B. 1999. Archiving the Internet. home page: www.alexa.com/ brewster/essays/sciam article.html]]

KEPHART J. HANSON J. LEVINE D. GROSOF B. SAIRAMESH J. AND WHITE R. S. 1998a. Emergent behavior in information economies.]] KEPHART J. HANSON J. LEVINE D. GROSOF B. SAIRAMESH J. AND WHITE R. S. 1998a. Emergent behavior in information economies.]]

10.5555/286139.286147

KLEINBERG , J. M. 1998 . Authoritative sources in a hyperlinked environment . In Proceedings of the 1998 ACM-SIAM Symposium on Discrete Algorithms (San Francisco CA, Jan.). ACM Press , New York, NY.]] KLEINBERG, J. M. 1998. Authoritative sources in a hyperlinked environment. In Proceedings of the 1998 ACM-SIAM Symposium on Discrete Algorithms (San Francisco CA, Jan.). ACM Press, New York, NY.]]

KOBAYASHI , M. , DUPRET , G. , KING , O. , SAMUKAWA , H. , AND TAKEDA , K. 1999 . Multi-perspective retrieval, ranking and visualization of web data . In Proceedings of the International Symposium on Digital Libraries ((ISDL99) , Tsukuba, Japan). 159-162.]] KOBAYASHI, M., DUPRET, G., KING, O., SAMUKAWA, H., AND TAKEDA, K. 1999. Multi-perspective retrieval, ranking and visualization of web data. In Proceedings of the International Symposium on Digital Libraries ((ISDL99), Tsukuba, Japan). 159-162.]]

KORFHAGE , R. R. 1997. Information Storage and Retrieval . John Wiley and Sons, Inc. , New York, NY .]] KORFHAGE, R. R. 1997. Information Storage and Retrieval. John Wiley and Sons, Inc., New York, NY.]]

KOSTER , M. 1995 . Robots in the web: trick or treat ? ConneXions 9 , 4 (Apr.).]] KOSTER, M. 1995. Robots in the web: trick or treat? ConneXions 9, 4 (Apr.).]]

KOSTER M. 1996. Examination of the standard for robots exclusion. home page: info.webcrawler- .com/mak/projects/robots/eval.html]] KOSTER M. 1996. Examination of the standard for robots exclusion. home page: info.webcrawler- .com/mak/projects/robots/eval.html]]

10.1109/ICIP.1999.822860

LAGOZE C. 1996. The Warwick framework: A container architecture for diverse sets of metadata. D-Lib Mag. www.dlib.org]] LAGOZE C. 1996. The Warwick framework: A container architecture for diverse sets of metadata. D-Lib Mag. www.dlib.org]]

10.1145/223904.223956

10.1109/4236.707689

10.1126/science.280.5360.98

10.1038/21987

10.1109/35.739314

LAWRENCE , S. AND GILES , C. 1999 c. Text and image metasearch on the web . In Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA99) . 829-835.]] LAWRENCE,S.AND GILES, C. 1999c. Text and image metasearch on the web. In Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA99). 829-835.]]

LEIGHTON H.AND SRIVASTAVA J. 1997. Precision among world wide web search engines: Altavista excite hotbot infoseek and lycos. home page: www.winona.msus.edu/library/webind2/ webind2.htm.]] LEIGHTON H.AND SRIVASTAVA J. 1997. Precision among world wide web search engines: Altavista excite hotbot infoseek and lycos. home page: www.winona.msus.edu/library/webind2/ webind2.htm.]]

10.1016/S0020-0255(97)00044-3

LIBERATORE K. 1997. Getting to the source: Is it real or spam ma'am ? MacWorld.]] LIBERATORE K. 1997. Getting to the source: Is it real or spam ma'am ? MacWorld.]]

LIDSKY , D. AND KWON , R. 1997 . Searching the net . PC Mag. , 227 - 258 .]] LIDSKY,D.AND KWON, R. 1997. Searching the net. PC Mag., 227-258.]]

10.1016/S0169-7552(98)00039-7

LOSEE , R. M. 1998. Text Retrieval and Filtering: Analytic Models of Performance . Kluwer international series on information retrieval. Kluwer Academic Publishers , Hingham, MA.]] LOSEE, R. M. 1998. Text Retrieval and Filtering: Analytic Models of Performance. Kluwer international series on information retrieval. Kluwer Academic Publishers, Hingham, MA.]]

10.1038/scientificamerican0397-52

10.1016/S0169-7552(97)00050-0

MACSKASSY , S. , BANERJEE , A. , DAVISON , B. , AND HIRSH , H. 1998 . Human performance on clustering web pages: A preliminary study . In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining ( Seattle, WA, June '98). 264-268.]] MACSKASSY, S., BANERJEE, A., DAVISON, B., AND HIRSH, H. 1998. Human performance on clustering web pages: A preliminary study. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (Seattle, WA, June '98). 264-268.]]

MANBER , U. 1999. Foreword . In Modern Information Retrieval, R. Baeza-Yates and B. Ribeiro-Neto . Addison-Wesley , Reading, MA , 5-8.]] MANBER, U. 1999. Foreword. In Modern Information Retrieval, R. Baeza-Yates and B. Ribeiro-Neto. Addison-Wesley, Reading, MA, 5-8.]]

MANBER , U. , SMITH , M. , AND GOPAL , B. 1997 . Webglimpse: Combining borwsing and searching . In Proceedings on USENIX 1997 Annual Technical Conference (Jan.). 195-206 .]] MANBER, U., SMITH, M., AND GOPAL, B. 1997. Webglimpse: Combining borwsing and searching. In Proceedings on USENIX 1997 Annual Technical Conference (Jan.). 195-206.]]

10.1109/34.531803

MANNING , C. AND SCHUTZE , H. 1999. Foundations of Statistical Natural Language Processing . MIT Press, Cambridge , MA .]] MANNING,C.AND SCHUTZE, H. 1999. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA.]]

MARCHIONINI , G. 1995. Information Seeking in Electronic Environments . Cambridge Series on Human-Computer Interaction . Cambridge University Press , New York, NY .]] MARCHIONINI, G. 1995. Information Seeking in Electronic Environments. Cambridge Series on Human-Computer Interaction. Cambridge University Press, New York, NY.]]

MAYBURY , M. 1997. Intelligent Multmedia Information Retrieval . MIT Press, Cambridge , MA .]] MAYBURY, M. 1997. Intelligent Multmedia Information Retrieval. MIT Press, Cambridge, MA.]]

MAYBURY , M.T. AND WAHLSTER , W. , EDS. 1998. Readings in Intelligent User Interfaces . Morgan Kaufmann Publishers Inc ., San Francisco, CA.]] MAYBURY,M.T.AND WAHLSTER, W., EDS. 1998. Readings in Intelligent User Interfaces. Morgan Kaufmann Publishers Inc., San Francisco, CA.]]

MCKNIGHT , L. 2000 . Pricing internet services: Approaches and challenges . IEEE Computer , 128 - 129 .]] MCKNIGHT, L. 2000. Pricing internet services: Approaches and challenges. IEEE Computer, 128-129.]]

10.1145/238386.238406

MITCHELL S. 1998. General internet resource finding tools. home pages: library.ucr.edu/pubs/ navigato.html]] MITCHELL S. 1998. General internet resource finding tools. home pages: library.ucr.edu/pubs/ navigato.html]]

MIYAHARA T WATANABE H. TAZOE E. KAMIYAMA Y. AND TAKEDA K. 2000. Internet Machine Translation. Mainichi Communications Japan.]] MIYAHARA T WATANABE H. TAZOE E. KAMIYAMA Y. AND TAKEDA K. 2000. Internet Machine Translation. Mainichi Communications Japan.]]

10.1145/336296.336351

MONIER L. 1998. Altavista cto responds. www4.zdnet.com/anchordesk/talkback/talkback 13066.html.]] MONIER L. 1998. Altavista cto responds. www4.zdnet.com/anchordesk/talkback/talkback 13066.html.]]

MOROHASHI , M. , TAKEDA , K. , NOMIYAMA , H. , AND MARUYAMA , H. 1995 . Information outlining . In Proceedings of International Symposium on Digital Libraries ( Tsukuba, Japan).]] MOROHASHI, M., TAKEDA, K., NOMIYAMA, H., AND MARUYAMA, H. 1995. Information outlining. In Proceedings of International Symposium on Digital Libraries (Tsukuba, Japan).]]

10.1145/217306.217311

10.3115/980691.980720

NAGAO K. HOSOYA S. KAWAKITA Y. ARIGA S. SHIRAI Y. AND YURA J. 1999. Semantic transcoding: Making the world wide web more understandable and reusable by external annotations.]] NAGAO K. HOSOYA S. KAWAKITA Y. ARIGA S. SHIRAI Y. AND YURA J. 1999. Semantic transcoding: Making the world wide web more understandable and reusable by external annotations.]]

NG , R. AND HAN , J. 1994 . Efficient and effective methods for spatial data mining . In Proceedings of the 20th International Conference on Very Large Data Bases (VLDB'94 , Santiago, Chile, Sept.). VLDB Endowment, Berkeley, CA.]] NG,R.AND HAN, J. 1994. Efficient and effective methods for spatial data mining. In Proceedings of the 20th International Conference on Very Large Data Bases (VLDB'94, Santiago, Chile, Sept.). VLDB Endowment, Berkeley, CA.]]

10.1117/12.143648

NIELSEN , J. 1993. Usability Engineering . Academic Press Prof ., Inc., San Diego, CA.]] NIELSEN, J. 1993. Usability Engineering. Academic Press Prof., Inc., San Diego, CA.]]

10.1145/291469.291470

NOMIYAMA , H. , KUSHIDA , T. , URAMOTO , N. , IOKA , M. , KUSABA , M. , KUSABA , J.-K. , CHIGONO , A. , ITOH , T. , AND TSUJI , M. 1997. Information navigation system for multimedia data. Res. Rep. RT-0227. Research Laboratory , IBM Tokyo , Tokyo, Japan .]] NOMIYAMA, H., KUSHIDA, T., URAMOTO, N., IOKA, M., KUSABA, M., KUSABA, J.-K., CHIGONO, A., ITOH, T., AND TSUJI, M. 1997. Information navigation system for multimedia data. Res. Rep. RT-0227. Research Laboratory, IBM Tokyo, Tokyo, Japan.]]

OARD , D. 1997 a. Cross-language text retrieval research in the USA . In Proceedings of the Third Delos Workshop on ERCIM (Mar.).]] OARD, D. 1997a. Cross-language text retrieval research in the USA. In Proceedings of the Third Delos Workshop on ERCIM (Mar.).]]

OARD , D. 1997 b. Serving users in many languages . D-Lib Mag. 3 , 1 (Jan.).]] OARD, D. 1997b. Serving users in many languages. D-Lib Mag. 3, 1 (Jan.).]]

10.1145/99935.99947

10.1145/274497.274521

PARLETT , B. N. 1998. The Symmetric Eigenvalue Problem . Prientice-Hall SIAM Classics in Applied Mathematics Series . Prentice-Hall, Inc. , Upper Saddle River, NJ.]] PARLETT, B. N. 1998. The Symmetric Eigenvalue Problem. Prientice-Hall SIAM Classics in Applied Mathematics Series. Prentice-Hall, Inc., Upper Saddle River, NJ.]]

10.1145/290941.290957

10.1145/238386.238450

PRESCHEL , B. 1972. Indexer consistency in perception of concepts and choice of terminology. Final Rep. Columbia Univ ., New York, NY .]] PRESCHEL, B. 1972. Indexer consistency in perception of concepts and choice of terminology. Final Rep. Columbia Univ., New York, NY.]]

RAGHAVAN , P. 1997 . Information retrieval algorithms: A survey . In Proceedings of the Symposium on Discrete Algorithms. ACM Press , New York, NY.]] RAGHAVAN, P. 1997. Information retrieval algorithms: A survey. In Proceedings of the Symposium on Discrete Algorithms. ACM Press, New York, NY.]]

10.1145/238386.238405

10.1145/205323.205326

RASMUSSEN , E. 1992 . Clustering algorithms. In Information Retrieval: Data Structures and Algorithms, W. B. Frakes and R. Baeza-Yates, Eds. Prentice-Hall, Inc., Upper Saddle River , NJ , 419 - 442 .]] RASMUSSEN, E. 1992. Clustering algorithms. In Information Retrieval: Data Structures and Algorithms, W. B. Frakes and R. Baeza-Yates, Eds. Prentice-Hall, Inc., Upper Saddle River, NJ, 419-442.]]

10.1109/34.531800

10.1145/192426.192429

10.1145/175235.175242

10.1145/108844.108883

10.1109/ICSMC.1999.812400

10.1002/asi.4630200109

10.1126/science.168.3929.335

SALTON , G. , ED. 1971. The Smart Retrieval System: Experiments in Automatic Document Processing . Prentice-Hall , Englewood Cliffs, NJ .]] SALTON, G., ED. 1971. The Smart Retrieval System: Experiments in Automatic Document Processing. Prentice-Hall, Englewood Cliffs, NJ.]]

SALTON , G. , ED. 1988. Automatic Text Processing . Addison-Wesley Series in Computer Science. Addison-Wesley Longman Publ . Co., Inc., Reading, MA.]] SALTON, G., ED. 1988. Automatic Text Processing. Addison-Wesley Series in Computer Science. Addison-Wesley Longman Publ. Co., Inc., Reading, MA.]]

SALTON , G. AND MCGILL , M. J. 1983. Introduction to Modern Information Retrieval . McGraw- Hill, Inc. , Hightstown, NJ .]] SALTON,G.AND MCGILL, M. J. 1983. Introduction to Modern Information Retrieval. McGraw- Hill, Inc., Hightstown, NJ.]]

10.1109/34.531799

10.1126/science.275.5298.327

SCHAUBLE , P. 1997. Multimedia Information Retrieval: Content-Based Information Retrieval from Large Text and Audio Databases . Kluwer Academic Publishers, Hingham , MA .]] SCHAUBLE, P. 1997. Multimedia Information Retrieval: Content-Based Information Retrieval from Large Text and Audio Databases. Kluwer Academic Publishers, Hingham, MA.]]

SCIENTIFIC AMERICAN . 1997 . The Internet: Fulfillling the promise: special report. Scientific American , Inc., New York, NY. ]] SCIENTIFIC AMERICAN. 1997. The Internet: Fulfillling the promise: special report. Scientific American, Inc., New York, NY.]]

SELBERG , E. AND ETZIONI , O. 1995a. The metacrawler architecture for resource aggregation on the web . IEEE Expert .]] SELBERG,E.AND ETZIONI, O. 1995a. The metacrawler architecture for resource aggregation on the web. IEEE Expert.]]

SELBERG , E. AND ETZIONI , O. 1995 b. Multiple service search and comparison using the metacrawler . In Proceedings of the Fourth International Conference on The World Wide Web ( Boston, MA).]] SELBERG,E.AND ETZIONI, O. 1995b. Multiple service search and comparison using the metacrawler. In Proceedings of the Fourth International Conference on The World Wide Web (Boston, MA).]]

10.1016/S0169-7552(97)00048-2

SHIVAKUMAR , N. AND GARCIA-MOLINA , H. 1998 . Finding near-replicas of documents on the web . In Proceedings of the Workshop on Web Databases ( Valencia, Spain, Mar.).]] SHIVAKUMAR,N.AND GARCIA-MOLINA, H. 1998. Finding near-replicas of documents on the web. In Proceedings of the Workshop on Web Databases (Valencia, Spain, Mar.).]]

10.1109/34.531805

SILBERSCHATZ , A. , STONEBRAKER , M. , AND ULLMAN , J. 1995 . Database research: Achievements and opportunities into the 21st century . In Proceedings of the NSF Workshop on The Future of Database Research (May).]] SILBERSCHATZ, A., STONEBRAKER, M., AND ULLMAN, J. 1995. Database research: Achievements and opportunities into the 21st century. In Proceedings of the NSF Workshop on The Future of Database Research (May).]]

10.1002/asi.4630240406

10.1145/244130.244151

SMITH , J.R. AND CHANG , S.-F. 1997a. Querying by color regions using VisualSEEk content-based visual query system . In Intelligent Multimedia Information Retrieval, M. T. Maybury, Ed. MIT Press, Cambridge , MA , 23-41.]] SMITH,J.R.AND CHANG, S.-F. 1997a. Querying by color regions using VisualSEEk content-based visual query system. In Intelligent Multimedia Information Retrieval, M. T. Maybury, Ed. MIT Press, Cambridge, MA, 23-41.]]

SMITH , J. AND CHANG , S.-F. 1997b. Searching for images and videos on the world-wide web . IEEE MultiMedia .]] SMITH,J.AND CHANG, S.-F. 1997b. Searching for images and videos on the world-wide web. IEEE MultiMedia.]]

SMITH Z. 1973. The truth about the web: Crawling towards eternity. Web Tech. Mag. www.webtechniques. com/features/1997/05/burner/burner. html]] SMITH Z. 1973. The truth about the web: Crawling towards eternity. Web Tech. Mag. www.webtechniques. com/features/1997/05/burner/burner. html]]

10.1109/93.311653

SNEATH , P.H. A. AND SOKAL , R. R. 1973. Numerical Taxonomy . Freeman , London, UK .]] SNEATH,P.H.A.AND SOKAL, R. R. 1973. Numerical Taxonomy. Freeman, London, UK.]]

10.5555/4543

SOFFER A.AND SAMET H. 2000. Pictorial query specification for browsing through spatiallyreferenced image databases. J. Visual Lang. Comput.]] SOFFER A.AND SAMET H. 2000. Pictorial query specification for browsing through spatiallyreferenced image databases. J. Visual Lang. Comput.]]

SPARCK JONES , K. AND WILLETT , P. , EDS. 1997. Readings In Information Retrieval . Morgan Kaufmann multimedia information and systems series . Morgan Kaufmann Publishers Inc ., San Francisco, CA.]] SPARCK JONES,K.AND WILLETT, P., EDS. 1997. Readings In Information Retrieval. Morgan Kaufmann multimedia information and systems series. Morgan Kaufmann Publishers Inc., San Francisco, CA.]]

10.1145/223784.223807

STRATEGYALLEY. 1998. White paper on the viability of the internet for business. home page: www.strategyalley.com/articles/inet1.htm.]] STRATEGYALLEY. 1998. White paper on the viability of the internet for business. home page: www.strategyalley.com/articles/inet1.htm.]]

STRZALKOWSKI , T. 1999. Natural Language Information Retreival . Kluwer Academic Publishers , Hingham, MA .]] STRZALKOWSKI, T. 1999. Natural Language Information Retreival. Kluwer Academic Publishers, Hingham, MA.]]

TAKEDA , K. AND NOMIYAMA , H. 1997 . Information outlining and site outlining . In Proceedings of the International Symposium on Digital Libraries (ISDL97 , Tsukuba, Japan).]] TAKEDA,K.AND NOMIYAMA, H. 1997. Information outlining and site outlining. In Proceedings of the International Symposium on Digital Libraries (ISDL97, Tsukuba, Japan).]]

TETRANET SOFTWARE INC. 1998. Wisebot. Home page for Wisebots: www.tetranetsoftware.com/ products/wisebot.htm]] TETRANET SOFTWARE INC. 1998. Wisebot. Home page for Wisebots: www.tetranetsoftware.com/ products/wisebot.htm]]

TUFTE , E. R. 1986. The Visual Display of Quantitative Information . Graphics Press, Cheshire , CT .]] TUFTE, E. R. 1986. The Visual Display of Quantitative Information. Graphics Press, Cheshire, CT.]]

10.1108/eb026637

VAN RIJSBERGEN , C. 1979. Information Retrieval . 2 nd ed. Butterworths, London , UK .]] VAN RIJSBERGEN, C. 1979. Information Retrieval. 2nd ed. Butterworths, London, UK.]]

WALKER J. CASE T. JORASCH J. AND SPARICO T. 1996. Method apparatus and program for pricing selling and exercising options to purchase airline tickets: U.S. Patent no. 5797127.]] WALKER J. CASE T. JORASCH J. AND SPARICO T. 1996. Method apparatus and program for pricing selling and exercising options to purchase airline tickets: U.S. Patent no. 5797127.]]

WALKER J. SPARICO T. AND CASE T. 1997. Method and apparatus for the sale of airline-specified flight tickets: U.S. Patent no. 5897620.]] WALKER J. SPARICO T. AND CASE T. 1997. Method and apparatus for the sale of airline-specified flight tickets: U.S. Patent no. 5897620.]]

10.3115/993268.993389

WEBSTER K.AND PAUL K. 1996. Beyond surfing: Tools and techniques for searching the web. home page: magi.com/mmelick/it96jan.htm.]] WEBSTER K.AND PAUL K. 1996. Beyond surfing: Tools and techniques for searching the web. home page: magi.com/mmelick/it96jan.htm.]]

WESTERA G. 1996. Robot-driven search engine evaluation overview. www.curtin.edu.au/curtin/ library/staffpages/gwpersonal/senginestudy/.]] WESTERA G. 1996. Robot-driven search engine evaluation overview. www.curtin.edu.au/curtin/ library/staffpages/gwpersonal/senginestudy/.]]

WHITE H.AND MCCAIN K. 1989. Bibliometrics. Annual Review Information Science and Technology.]] WHITE H.AND MCCAIN K. 1989. Bibliometrics. Annual Review Information Science and Technology.]]

10.1016/0306-4573(88)90027-1

10.1016/S0020-7373(84)80052-8

10.1145/133160.133216

10.5555/857186.857579

WITTEN , I.H. , MOFFAT , A. , AND BELL , T. C. 1994 . Managing Gigabytes: Compressing and Indexing Documents and Images . Van Nostrand Reinhold Co., New York , NY .]] WITTEN,I.H.,MOFFAT, A., AND BELL, T. C. 1994. Managing Gigabytes: Compressing and Indexing Documents and Images. Van Nostrand Reinhold Co., New York, NY.]]

10.1109/93.311656

10.1145/290941.290956

ZAMIR , O. , ETZIONI , O. , MADANI , O. , AND KARP , R. 1997 . Fast and intuitive clustering of web documents . In Proceedings of the ACM SIG- MOD International Workshop on Data Mining and Knowledge Discovery (SIGMOD-96 , Aug.), R. Ng, Ed. ACM Press, New York, NY, 287- 290.]] ZAMIR, O., ETZIONI, O., MADANI, O., AND KARP,R. 1997. Fast and intuitive clustering of web documents. In Proceedings of the ACM SIG- MOD International Workshop on Data Mining and Knowledge Discovery (SIGMOD-96, Aug.), R. Ng, Ed. ACM Press, New York, NY, 287- 290.]]

10.1145/233269.233324

10.1109/ISCAS.1997.622202