A Linear Algebra Approach to C-Means Clustering Using GPUs and MPI

作者: Apostolos Glenis , Vu Pham

DOI: 10.1109/PCI.2012.24

关键词:

摘要: The fuzzy c-means clustering is a well-known unsupervised algorithm and has been widely used in various pattern recognition applications. As the amount of data increase, however, basic serial implementation becomes overwhelmed. This main motivation for utilizing computational power parallel machines to speed up algorithm. We present an that exploits mathematical equations create building blocks based on linear algebra functions are optimized most available architectures. implemented our both GPU (using CUDA CUBLAS) MPI MPI4py NumPy), then evaluated their performance scalability. Experiments show outperforms all implementations have proposed so far.

参考文章(9)
Andrew D. Pangborn, Scalable data clustering using GPUs ,(2010)
Enrique H. Ruspini, A new approach to clustering Information & Computation. ,vol. 15, pp. 22- 32 ,(1969) , 10.1016/S0019-9958(69)90591-9
D.T. Anderson, R.H. Luke, J.M. Keller, Speedup of Fuzzy Clustering Through Stream Processing on Graphics Processing Units IEEE Transactions on Fuzzy Systems. ,vol. 16, pp. 1101- 1106 ,(2008) , 10.1109/TFUZZ.2008.924203
N.R. Pal, J.C. Bezdek, Complexity reduction for "large image" processing systems man and cybernetics. ,vol. 32, pp. 598- 611 ,(2002) , 10.1109/TSMCB.2002.1033179
S. Eschrich, Jingwei Ke, L.O. Hall, D.B. Goldgof, Fast accurate fuzzy clustering through data reduction IEEE Transactions on Fuzzy Systems. ,vol. 11, pp. 262- 270 ,(2003) , 10.1109/TFUZZ.2003.809902
J.C. Bezdek, R.J. Hathaway, Progressive sampling schemes for approximate clustering in very large data sets ieee international conference on fuzzy systems. ,vol. 1, pp. 15- 21 ,(2004) , 10.1109/FUZZY.2004.1375677
A. Baraldi, P. Blonda, A survey of fuzzy clustering algorithms for pattern recognition. II systems man and cybernetics. ,vol. 29, pp. 778- 785 ,(1999) , 10.1109/3477.809032