Generalized vector spaces model in information retrieval

作者: S. K. M. Wong , Wojciech Ziarko , Patrick C. N. Wong

DOI: 10.1145/253495.253506

关键词: Representation (mathematics)Theoretical computer scienceDivergence-from-randomness modelVector spaceTerm DiscriminationAutomatic indexingFunction spaceVector space modelInformation retrievalGeneralized vector space modelComputer science

摘要: In information retrieval, it is common to model index terms and documents as vectors in a suitably defined vector space. The main difficulty with this approach that the explicit representation of term not known priori. For reason, space adopted by Salton for SMART system treats set orthogonal vectors. such often necessary adopt separate, corrective procedure take into account correlations between terms. paper, we propose systematic method (the generalized model) compute directly from automatic indexing scheme. We also demonstrate how can be included minimal modification existing based retrieval systems. preliminary experimental results obtained new are very encouraging.

参考文章(9)
S. K. M. Wong, Vijay V. Raghavan, Vector space model of information retrieval: a reevaluation international acm sigir conference on research and development in information retrieval. pp. 167- 185 ,(1984) , 10.5555/636805.636816
G. Salton, C. T. Yu, C. Buckley, An evaluation of term dependence models in information retrieval international acm sigir conference on research and development in information retrieval. pp. 151- 173 ,(1982) , 10.5555/636713.636726
Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval ,(1983)
D.J. HARPER, C.J. VAN RIJSBERGEN, AN EVALUATION OF FEEDBACK IN DOCUMENT RETRIEVAL USING CO‐OCCURRENCE DATA Journal of Documentation. ,vol. 34, pp. 189- 216 ,(1978) , 10.1108/EB026659
Matthew B. Koll, WEIRD ACM SIGIR Forum. ,vol. 13, pp. 32- 50 ,(1979) , 10.1145/1095366.1095368
Vijay V. Raghavan, C. T. Yu, Experiments on the determination of the relationships between terms ACM Transactions on Database Systems. ,vol. 4, pp. 240- 260 ,(1979) , 10.1145/320071.320081
C.J. VAN RIJSBERGEN, A THEORETICAL BASIS FOR THE USE OF CO‐OCCURRENCE DATA IN INFORMATION RETRIEVAL Journal of Documentation. ,vol. 33, pp. 106- 119 ,(1977) , 10.1108/EB026637