On clustering tree structured data with categorical nature

作者: B. Boutsinas , T. Papastergiou

DOI: 10.1016/J.PATCOG.2008.05.023

关键词:

摘要: Clustering consists in partitioning a set of objects into disjoint and homogeneous clusters. For many years, clustering methods have been applied wide variety disciplines they also utilized scientific areas. Traditionally, deal with numerical data, i.e. represented by conjunction attribute values. However, nowadays commercial or databases usually contain categorical attributes. In this paper we present dissimilarity measure which is capable to tree structured data. Thus, it can be used for extending the various versions very popular k-means algorithm such We discuss how an extension achieved. Moreover, empirically prove that proposed accurate, compared other well-known (dis)similarity measures

参考文章(48)
Nick Koudas, Beng Chin Ooi, Suresh Venkatasubramanian, Divesh Srivastava, Bing Tian Dai, Column heterogeneity as a measure of data quality CleanDB. pp. 1- ,(2006)
Valentina Tamma, Floriana Esposito, Donato Malerba, Vincenzo Gioviale, Comparing Dissimilarity Measures for Symbolic Data Analysis ,(2001)
Analysis of Symbolic Data Springer Berlin Heidelberg. ,(2000) , 10.1007/978-3-642-57155-8
Mounira Harzallah, Emmanuel Blanchard, Pascale Kuntz, Henri Briand, A Typology Of Ontology-Based Semantic Measures. EMOI-INTEROP. ,(2005)
Richard Dubes, A.K. Jain, Clustering Methodologies in Exploratory Data Analysis Advances in Computers. ,vol. 19, pp. 113- 228 ,(1980) , 10.1016/S0065-2458(08)60034-0
Ryszard S. Michalski, Robert E. Stepp, Learning from Observation: Conceptual Clustering Machine Learning. pp. 331- 363 ,(1983) , 10.1007/978-3-662-12405-5_11
Martin Volk, H Oxhammar, M Warin, Enriching an ontology with WordNet based on similarity measures Warin, M; Oxhammar, H; Volk, Martin (2005). Enriching an ontology with WordNet based on similarity measures. In: MEANING-2005 Workshop, Trento, 2005 - 2005.. ,(2005) , 10.5167/UZH-20381
J. G. Carbonell, T. M. Mitchell, R. S. Michalski, Machine Learning: An Artificial Intelligence Approach Springer Publishing Company, Incorporated. ,(2013)