An algorithm for the calculation of exact term discrimination values

作者: Peter Willett

DOI: 10.1016/0306-4573(85)90107-4

关键词:

摘要: Abstract Term discrimination values have been suggested as an effective means for the selection and weighting of index terms in automatic document retrieval systems. This paper reports algorithm calculation term that is sufficiently fast operation to permit use exact values, rather than approximate studied previous work. Evidence presented show relationship between frequency crucially dependent upon type inter-document similarity measure used values.

参考文章(10)
Gerard Salton, Theory of Indexing Society for Industrial and Applied Mathematics. ,(1975) , 10.1137/1.9781611970500
Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval ,(1983)
S. E. Robertson, Term frequency and term value ACM SIGIR Forum. ,vol. 16, pp. 22- 29 ,(1981) , 10.1145/1013228.511758
G. Salton, A. Wong, On the role of words and phrases in automatic text analysis Computers and the Humanities. ,vol. 10, pp. 69- 87 ,(1976) , 10.1007/BF02426255
Robert G. Crawford, The computation of discrimination values Information Processing & Management. ,vol. 11, pp. 249- 253 ,(1975) , 10.1016/0306-4573(75)90022-9
G. Salton, C. S. Yang, C. T. Yu, A Theory of Term Importance in Automatic Text Analysis Journal of the Association for Information Science and Technology. ,vol. 26, pp. 33- 44 ,(1974) , 10.1002/ASI.4630260106
M.F. Porter, An algorithm for suffix stripping Program: Electronic Library and Information Systems. ,vol. 40, pp. 313- 316 ,(1997) , 10.1108/EB046814
Shirley A. Perry, Peter Willett, A review of the use of inverted files for best match searching in information retrieval systems Journal of Information Science. ,vol. 6, pp. 59- 66 ,(1983) , 10.1177/016555158300600204
Terry Noreault, Matthew Koll, Michael J. McGill, Automatic ranked output from boolean searches in SIRE Journal of the Association for Information Science and Technology. ,vol. 28, pp. 333- 339 ,(1977) , 10.1002/ASI.4630280605
G. Salton, A. Wong, C. S. Yang, A vector space model for automatic indexing Communications of the ACM. ,vol. 18, pp. 613- 620 ,(1975) , 10.1145/361219.361220