Exploration of Document Collections with Self-Organizing Maps: A Novel Approach to Similarity Representation

作者: Dieter Merkl

DOI: 10.1007/3-540-63223-9_110

关键词: Computer scienceInformation retrievalSimilarity (psychology)Structure (mathematical logic)Artificial neural networkCluster analysisDigital libraryRepresentation (mathematics)Unsupervised learningSelf-organizing map

摘要: Classification is one of the central issues in any system dealing with text data. The need for effective approaches dramatically increased nowadays due to advent massive digital libraries containing free-form documents. What we are looking powerful methods exploration such whereby detection similarities between various documents overall goal. In other words, that may be used gain insight inherent structure items contained a archive needed. this paper demonstrate applicability self-organizing maps, neural network model adhering unsupervised learning paradigm, task document clustering. order improve representation result present an extension basic rule captures movement weight vectors two-dimensional output space convenient visual inspection. extended training algorithm allows intuitive analysis input data and most important, recognition cluster boundaries.

参考文章(19)
Teuvo Kohonen, Self-organized formation of topologically correct feature maps Biological Cybernetics. ,vol. 43, pp. 509- 521 ,(1988) , 10.1007/BF00337288
Dieter Merkl, A Connectionist View on Document Classification australasian database conference. pp. 0- ,(1995)
D. Merkl, Content-based software classification by self-organization international conference on networks. ,vol. 2, pp. 1086- 1091 ,(1995) , 10.1109/ICNN.1995.487573
Monika Köhle, Dieter Merkl, Visualizing Similarities in High Dimensional Input Spaces with a Growing and Splitting Neural Network international conference on artificial neural networks. pp. 581- 586 ,(1996) , 10.1007/3-540-61510-5_99
David E. Rumelhart, James L. McClelland, , Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations Computational Models of Cognition and Perception. ,(1986) , 10.7551/MITPRESS/5236.001.0001
Teuvo Kohonen, Self-Organizing Maps ,(1995)
Gerard Salton, Automatic text processing: the transformation, analysis, and retrieval of information by computer Addison-Wesley Longman Publishing Co., Inc.. ,(1989)
Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval ,(1983)
Gerard Salton, Christopher Buckley, Term Weighting Approaches in Automatic Text Retrieval Information Processing and Management. ,vol. 24, pp. 323- 328 ,(1988) , 10.1016/0306-4573(88)90021-0