Incremental training for probabilistic categorizer

作者: Cyril Goutte , Eric Gaussier

DOI:

关键词:

摘要: A probabilistic document categorizer has an associated vocabulary of words and plurality parameters derived from a collection documents. new is received. The are updated to reflect addition the documents based on contained in document, category size parameter indicative effective total number instances

参考文章(6)
E. Gaussier, C. Goutte, K. Popat, F. Chen, A Hierarchical Model for Clustering and Categorising Documents Lecture Notes in Computer Science. pp. 229- 247 ,(2002) , 10.1007/3-540-45886-7_16
Boriana L. Milenova, Marcos M. Campos, Probabilistic model generation ,(2003)
Yiming Yang, Jan O. Pedersen, A Comparative Study on Feature Selection in Text Categorization international conference on machine learning. pp. 412- 420 ,(1997)