Term-weighting approaches in automatic text retrieval

Information Processing & Management - Tập 24 - Trang 513-523 - 1988
Gerard Salton1, Christopher Buckley1
1Department of Computer Science, Cornell University, Ithaca, NY 14853, USA

Tài liệu tham khảo

Luhn, 1957, A statistical approach to the mechanized encoding and searching of literary information, IBM Journal of Research and Development, 1, 309, 10.1147/rd.14.0309 1971 Salton, 1983 van Rijsbergen, 1979 Luhn, 1955, A new method of recording and searching information, American Documentation, 4, 14, 10.1002/asi.5090040104 Taube, 1952, The logical structure of coordinate indexing, American Documentation, 3, 213, 10.1002/asi.5090030404 Perry, 1950, Information analysis for machine searching, American Documentation, 1, 133, 10.1002/asi.5090010303 van Rijsbergen, 1977, A theoretical basis for the use of cooccurrence data in information retrieval, Journal of Documentation, 33, 106, 10.1108/eb026637 Salton, 1983, An evaluation of term dependence models in information retrieval, 146, 151 Yu, 1983, A generalized term dependence model in information retrieval, Information Technology: Research and Development, 2, 129 Lesk, 1969, Word-word associations in document retrieval systems, American Documentation, 20, 27, 10.1002/asi.4630200106 Klingbiel, 1973, Machine aided indexing of technical literature, Information Storage and Retrieval, 9, 79, 10.1016/0020-0271(73)90020-X Klingbiel, 1973, A technique for machine aided indexing, Information Storage and Retrieval, 9, 477, 10.1016/0020-0271(73)90034-X Dillon, 1983, Fully automatic syntax-based indexing, Journal of the ASIS, 34, 99 Sparck Jones, 1984, Automatic search term variant generation, Journal of Documentation, 40, 50, 10.1108/eb026757 Fagan, 1987, Experiments in automatic phrase indexing for document retrieval: A comparison of syntactic and non-syntactic methods Smeaton, 1986, Incorporating syntactic information into a document retrieval strategy: An investigation, 103 Sparck Jones, 1971 Salton, 1972, Experiments in automatic thesaurus construction for information retrieval, 115 Dattola, 1971, Experiments with fast algorithms for automatic classification, 265 Walker, 1987, Knowledge resource tools for analyzing large text files, 247 Kucera, 1985, Uses of on-line lexicons, 7 Amsler, 1984, Machine-readable dictionaries, Vol. 19, 161 Fox, 1980, Lexical relations: Enhancing effectiveness of information retrieval systems, ACM SIGIR Forum, 15, 5, 10.1145/1095403.1095404 Croft, 1986, User-specified domain knowledge for document retrieval, 201 Thompson, 1985, An expert system for document retrieval, 448 Croft, 1987, Approaches to intelligent information retrieval, Information Processing & Management, 23, 249, 10.1016/0306-4573(87)90016-1 Sparck Jones, 1983, Intelligent retrieval, Vol. 7, 136 Fox, 1987, Development of the coder system: A testbed for artificial intelligence methods in information retrieval, Information Processing & Management, 23, 341, 10.1016/0306-4573(87)90022-7 Salton, 1986, On the use of knowledge based processing in automatic text retrieval, 277 Swanson, 1960, Searching natural language text by computer, Science, 132, 1099, 10.1126/science.132.3434.1099 Cleverdon, 1966, Aslib-Cranfield research project, Vol. 2 Cleverdon, 1977, A computer evaluation of searching by controlled languages and natural language in an experimental NASA database Lancaster, 1968 Blair, 1985, An evaluation of retrieval effectiveness for a full-text document retrieval system, Communications of the ACM, 28, 289, 10.1145/3166.3197 Salton, 1986, Another look at automatic text retrieval systems, Communications of the ACM, 29, 648, 10.1145/6138.6149 Salton, 1973, Recent studies in automatic text analysis and document retrieval, Journal of the ACM, 20, 258, 10.1145/321752.321757 Sparck Jones, 1972, A statistical interpretation of term specificity and its application in retrieval, Journal of Documentation, 28, 11, 10.1108/eb026526 Salton, 1973, On the specification of term values in automatic indexing, Journal of Documentation, 29, 351, 10.1108/eb026562 Salton, 1975, A theory of indexing Salton, 1975, A theory of term importance in automatic text analysis, Journal of the ASIS, 26, 33 Bookstein, 1975, A decision theoretic foundation for indexing, Journal of the ASIS, 26, 45 Cooper, 1978, Foundation of probabilistic and utility theoretic indexing, Journal of the ACM, 25, 67, 10.1145/322047.322053 Robertson, 1976, Relevance weighting of search terms, Journal of the ASIS, 27, 129 Croft, 1975, Using probabilistic models of information retrieval without relevance information, Journal of Documentation, 35, 285, 10.1108/eb026683 Wu, 1981, A comparison of search term weighting: Term relevance versus inverse document frequency, ACM SIGIR Forum, 16, 30, 10.1145/1013228.511759 Noreault, 1977, Automatic ranked output from Boolean searches in SIRE, Journal of the ASIS, 27, 333 Radecki, 1982, Incorporation of relevance feedback into Boolean retrieval systems, 146, 133 Paice, 1983, Soft evaluation of Boolean search queries in information retrieval systems, Information Technology: Research and Development, 3, 33 Cater, 1987, A topological information retrieval system (TIRS) satisfying the requirements of the Waller-Kraft wish list, 171 Wong, 1986, Extended Boolean query processing in the generalized vector space model Wong, 1985, Generalized vector space model in information retrieval, 18 Wu, 1981, On query formulation in information retrieval Salton, 1983, Extended Boolean information retrieval, Communications of the ACM, 26, 1022, 10.1145/182.358466 Croft, 1984, A comparison of the cosine correlation and the modified probabilistic model, Information Technology: Research and Development, 3, 113