作者: Teuvo Kohonen , Samuel Kaski , Krista Lagus , Jarkko Salojärvi , Jukka Honkela
DOI: 10.1016/B978-044450270-4/50013-9
关键词:
摘要: Publisher Summary This chapter discusses that when the self-organizing map (SOM) is applied to mapping of documents, one can represent them statistically by their weighted word frequency histograms or some reduced representations be regarded as data vectors. One SOM about seven million documents has been made, viz., all patent abstracts in world have written English and are available electronic form. The consists models. Keywords key texts used search for most relevant first. New effective coding computational schemes described. document organization, searching, browsing system called WEBSOM, described this chapter. original WEBSOM was two-level architecture, but it later simplified.