An Empirical Comparison of Text Categorization Methods

作者: Ana Cardoso-Cachopo , Arlindo L. Oliveira

DOI: 10.1007/978-3-540-39984-1_14

关键词:

摘要: In this paper we present a comprehensive comparison of the performance number text categorization methods in two different data sets. particular, evaluate Vector and Latent Semantic Analysis (LSA) methods, classifier based on Support Machines (SVM) k-Nearest Neighbor variations LSA models.

参考文章(23)
Gerard Salton, The SMART retrieval system ,(1971)
John Caron, Experiments with LSA scoring: optimal rank and basis Computational information retrieval. pp. 157- 169 ,(2001)
Nello Cristianini, J Shawe-Taylor, An introduction to Support Vector Machines Cambridge University Press (2000). ,(2000)
r;ribeiro-neto bueza-yates (b), Modern Information Retrieval ,(1999)
Gerard Salton, Christopher Buckley, Term Weighting Approaches in Automatic Text Retrieval Information Processing and Management. ,vol. 24, pp. 323- 328 ,(1988) , 10.1016/0306-4573(88)90021-0
G. Salton, M. E. Lesk, Computer Evaluation of Indexing and Text Processing Journal of the ACM. ,vol. 15, pp. 8- 36 ,(1968) , 10.1145/321439.321441
Brij Masand, Gordon Linoff, David Waltz, Classifying news stories using memory based reasoning international acm sigir conference on research and development in information retrieval. pp. 59- 65 ,(1992) , 10.1145/133160.133177
Shari L. Jackson, Steven J. Stratford, Joseph Krajcik, Elliot Soloway, A learner-centered tool for students building models Communications of The ACM. ,vol. 39, pp. 48- 49 ,(1996) , 10.1145/227210.227224
Yiming Yang, Xin Liu, A re-examination of text categorization methods international acm sigir conference on research and development in information retrieval. pp. 42- 49 ,(1999) , 10.1145/312624.312647