KIDBSCAN: A New Efficient Data Clustering Algorithm

作者: Cheng-Fa Tsai , Chih-Wei Liu

DOI: 10.1007/11785231_73

关键词:

摘要: Spatial data clustering plays an important role in numerous fields. Data algorithms have been developed recent years. K-means is fast, easily implemented and finds most local optima. IDBSCAN more efficient than DBSCAN. can also find arbitrary shapes detect noisy points for clustering. This investigation presents a new technique based on the concept of IDBSCAN, which used to high-density center then expand clusters from these points. has lower execution time because it reduces by selecting representative seeds. The simulation indicates that proposed KIDBSCAN yields accurate results. Additionally, this approach I/O cost. outperforms DBSCAN IDBSCAN.

参考文章(12)
Richard R. Muntz, Jiong Yang, Wei Wang, STING: A Statistical Information Grid Approach to Spatial Data Mining very large data bases. pp. 186- 195 ,(1997)
Hans-Peter Kriegel, Martin Ester, Jörg Sander, Xiaowei Xu, A density-based algorithm for discovering clusters in large spatial Databases with Noise knowledge discovery and data mining. pp. 226- 231 ,(1996)
Sudipto Guha, Rajeev Rastogi, Kyuseok Shim, Rock: A robust clustering algorithm for categorical attributes Information Systems. ,vol. 25, pp. 345- 366 ,(2000) , 10.1016/S0306-4379(00)00022-3
Tian Zhang, Raghu Ramakrishnan, Miron Livny, BIRCH: an efficient data clustering method for very large databases international conference on management of data. ,vol. 25, pp. 103- 114 ,(1996) , 10.1145/233269.233324
B. Borah, D.K. Bhattacharyya, An improved sampling-based DBSCAN for large spatial databases international conference on intelligent sensing and information processing. pp. 92- 96 ,(2004) , 10.1109/ICISIP.2004.1287631
Wei Wang, Jiong Yang, R. Muntz, STING+: an approach to active spatial data mining international conference on data engineering. pp. 116- 125 ,(1999) , 10.1109/ICDE.1999.754914
J. B. Macqueen, Some methods for classification and analysis of multivariate observations Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Statistics. ,vol. 1, pp. 281- 297 ,(1967)
Sudipto Guha, Rajeev Rastogi, Kyuseok Shim, CURE Proceedings of the 1998 ACM SIGMOD international conference on Management of data - SIGMOD '98. ,vol. 27, pp. 73- 84 ,(1998) , 10.1145/276304.276312
G. Karypis, Eui-Hong Han, V. Kumar, Chameleon: hierarchical clustering using dynamic modeling Computer. ,vol. 32, pp. 68- 75 ,(1999) , 10.1109/2.781637
R. Xu, D. WunschII, Survey of clustering algorithms IEEE Transactions on Neural Networks. ,vol. 16, pp. 645- 678 ,(2005) , 10.1109/TNN.2005.845141