作者: Cyril Goutte , Eric Gaussier
DOI:
关键词:
摘要: A probabilistic document categorizer has an associated vocabulary of words and plurality parameters derived from a collection documents. new is received. The are updated to reflect addition the documents based on contained in document, category size parameter indicative effective total number instances