AN EVALUATION OF FEEDBACK IN DOCUMENT RETRIEVAL USING CO‐OCCURRENCE DATA

Emerald - Tập 34 Số 3 - Trang 189-216 - 1978

Tóm tắt

This paper reports experiments with a term weighting model incorporating relevance information in which it is assumed that index terms are distributed dependently. Initially this model was tested with complete relevance information against a similar model which assumes index terms are distributed independently. The experiments demonstrated conclusively that index terms are not independent for a number of diverse document collections. It was concluded that the use of relevance information together with dependence information could potentially improve retrieval effectiveness. As a result of further experiments the initial strict dependence model was modified and in particular a new relevance‐based term weight was developed. This modified dependence model was then used as the basis for relevance feedback, i.e. with partial relevance information only, and significant increases in retrieval effectiveness were achieved. The evaluation method used in the feedback experiments emphasized the effect of the feedback on documents which the potential user would not previously have seen. Finally the incorporation of relevance feedback in an operational system is considered and in particular it is argued that if high recall searches are required, relevance feedback based on the modified dependence model may be superior to the widely used Boolean search.

Tài liệu tham khảo

ROBERTSON S. E., 1976, Journal of the ASIS, 27, 129 10.1145/321921.321930 10.1108/eb026637 HOLMES P ., 1977, On-line information retrieval-An introduction and guide to the British Library's short-term experimental information network project, 1 BARRACLOUGH E. D., 1975, T h e Medusa current awareness experiment CLEVERDON C. W., 1966, Factors determining the performance of indexing systems, 2 vols ATTCHISON T. M., 1970, Institute of Electrical Engineers KEEN E. M., 1972, Report of an information science languages test, 2 vols SPARCKJONES K., 1977, Research on automatic indexing 1974-6,2 vols. Computer Laboratory DUDA R. O., 1973, Pattern classification and scene analysis. N e w York: Wiley 10.1108/eb026647 VAN RIJSBERGEN C. J., Information retrieval, 2 CHOW C. K., IEEE Transactions on Information Theory, IT-14, 1968, 462 VAN RIJSBERGEN C. J., 1975, Information retrieval ROCCHIO JR., 1971, The SMART retrieval system. N e w Jersey IDE E., 1971, The SMART retrieval system. N e w Jersey IDE, E. Relevance feedback in an automatic document retrieval system. Master's thesis, Report ISR-15 to the National Science Foundation, Department of Computer Science, Cornell University, Ithaca, N.Y., 1969.