作者: Dieter Merkl
DOI: 10.1007/3-540-63223-9_110
关键词: Computer science 、 Information retrieval 、 Similarity (psychology) 、 Structure (mathematical logic) 、 Artificial neural network 、 Cluster analysis 、 Digital library 、 Representation (mathematics) 、 Unsupervised learning 、 Self-organizing map
摘要: Classification is one of the central issues in any system dealing with text data. The need for effective approaches dramatically increased nowadays due to advent massive digital libraries containing free-form documents. What we are looking powerful methods exploration such whereby detection similarities between various documents overall goal. In other words, that may be used gain insight inherent structure items contained a archive needed. this paper demonstrate applicability self-organizing maps, neural network model adhering unsupervised learning paradigm, task document clustering. order improve representation result present an extension basic rule captures movement weight vectors two-dimensional output space convenient visual inspection. extended training algorithm allows intuitive analysis input data and most important, recognition cluster boundaries.