作者: Susan Dumais , John Platt , David Heckerman , Mehran Sahami
关键词:
摘要: 1. ABSTRACT Text categorization – the assignment of natural language texts to one or more predefined categories based on their content is an important component in many information organization and management tasks. We compare effectiveness five different automatic learning algorithms for text terms speed, realtime classification accuracy. also examine training set size, alternative document representations. Very accurate classifiers can be learned automatically from examples. Linear Support Vector Machines (SVMs) are particularly promising because they very accurate, quick train, evaluate. 1.1