作者: Edwin Aldana-Bobadilla , Ivan Lopez-Arevalo , Hiram Galeana-Zapien , Melesio Crespo-Sanchez
DOI: 10.1109/ACCESS.2018.2882408
关键词: Partition (database) 、 Artificial intelligence 、 Linear programming 、 Cluster analysis 、 Pattern recognition 、 Genetic algorithm 、 Computer science
摘要: Clustering is an important task in data analysis to find a partition on unlabeled dataset based similarity relationships among its elements. Typically, such determined by proximity measure or distance. Then, the optimal one that minimizes distance elements belonging same subset and maximizes from different subsets. The way which found called clustering method. adequateness of commonly terms validity index. In this paper, we propose method referred as quality-driven search for (QDSOC) where process directly driven index instead measure. Our approach allows efficiently exploring large solution space via breed genetic algorithm, so-called eclectic algorithm. Unlike existing methods, proposed QDSOC offers provides mathematical model representation membership functions. This describes points belong subsets found. Thus, using model, can predict new objects without performing again. As part experimental evaluation, our compared with k-means self-organizing maps (SOMs), are two well-known approaches. methods were used solve wide sample problems, three indices. From obtained results, demonstrate statistically outperforms SOMs. We also point out does not incur excessive computational overhead respect traditional methods.