作者: Andreas Hotho , Steffen Staab , Gerd Stumme
DOI:
关键词:
摘要: Text document clustering plays an important role in providing intuitive navigation and browsing mechanisms by organizing large amounts of information into a small number meaningful clusters. The bag words representation used for these methods is often unsatisfactory as it ignores relationships between terms that do not co-occur literally. In order to deal with the problem, we integrate background knowledge — our application Wordnet process text documents. We cluster documents standard partitional algorithm. Our experimental evaluation on Reuters newsfeeds compares results pre-categorizations news. experiments, improvements compared baseline can be shown many interesting tasks.