NEW DIRECTIONS IN TEXT CATEGORIZATION

作者： Richard S. Forsyth

关键词:

摘要: As more and documents are held in machine-readable form, problems of efficient text processing analysis become pressing. An important kind processing, which has recently attracted the attention researchers Artificial Intelligence (AI), is categorization, e.g. automatically assigning news stories [11.5] or medical case notes [11.46] a suitable category code. However, classifying not new problem: workers field stylometry have been grappling with it for than century. Typically, stylometers given most to authorship attribution used statistical methods, while AI-based research concentrated on discrimination by subject matter, using machine-learning techniques. The present chapter reports several recent studies drawing both these traditions. In addition, investigates various methods Textual Feature-Finding, i.e. choosing textual features attributes that: (1) do depend subjective judgement; (2) need knowledge sources external texts being analyzed, such as computerized lexicon; (3) presume that studied English; (4) assume word only possible unit.

springer.com 本地加速

richardsandesforsyth.net PDF 下载加速

sci-hub.st HTML 下载加速

参考文章(63)

Heikki Mannila, Erja Nikunen, Helena Ahonen, Forming grammars for structured documents AAAIWS'93 Proceedings of the 2nd International Conference on Knowledge Discovery in Databases. pp. 314- 325 ,(1993)

F. N. Teskey, Principles of text processing ,(1982)

Thomas Bolton Horton, The effectiveness of the stylometry of function words in discriminating between Shakespeare and Fletcher The University of Edinburgh. ,(1987)

Nelleke Oostdijk, Corpus Linguistics and the Automatic Analysis of English ,(1991)

Richard Forsyth, Stylistic atructures: a computational approach to text classification University of Nottingham. ,(1996)

Roy Rada, Richard Forsyth, Machine learning : applications in expert systems and information retrieval ,(1986)

Sholom M. Weiss, Computer systems that learn ,(1990)

Louis Tonko Milic, A quantitative approach to the style of Jonathan Swift ,(1967)

David L. Wallace, Frederick Mosteller, Applied Bayesian and Classical Inference: The Case of The Federalist Papers ,(2012)

10.

A. Q. Morton, Literary detection: How to prove authorship and fraud in literature and documents ,(1978)

NEW DIRECTIONS IN TEXT CATEGORIZATION

来源期刊

我的账户

NEW DIRECTIONS IN TEXT CATEGORIZATION

来源期刊

相似文章 10

我的账户