CBCH (clustering-based convex hull) for reducing training time of support vector machine

作者: Pardis Birzhandi , Hee Yong Youn

DOI: 10.1007/S11227-019-02795-9

关键词:

摘要: Support vector machine (SVM) is an efficient learning technique widely applied to various classification problems due its robustness. However, the training time grows dramatically as number of data increases. As a result, applicability SVM large-scale datasets somewhat limited. In SVM, only few samples called support vectors (SVs) affect construction hyperplane. Therefore, removing irrelevant SVs does not degrade performance SVM. this paper clustering-based convex hull (CBCH) scheme introduced which allows efficiently remove insignificant and thereby reduce The CBCH initially applies k-mean clustering algorithm given points, then, each cluster obtained. Only vertices hulls points relevant are included points. Computer simulation over sizes types reveals that proposed considerably faster more accurate than existing classifiers. based on geometric interpretation applicable both linearly separable inseparable datasets.

参考文章(45)
Michal Kawulok, Jakub Nalepa, Support vector machines training data selection using a genetic algorithm SSPR'12/SPR'12 Proceedings of the 2012 Joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition. pp. 557- 565 ,(2012) , 10.1007/978-3-642-34166-3_61
Xiang-Jun Shen, Lei Mu, Zhen Li, Hao-Xiang Wu, Jian-Ping Gou, Xin Chen, Large-scale support vector machine classification with redundant data reduction Neurocomputing. ,vol. 172, pp. 189- 197 ,(2016) , 10.1016/J.NEUCOM.2014.10.102
Shu-yin Xia, Zhong-yang Xiong, Yue-guo Luo, Li-mei Dong, A method to improve support vector machine based on distance to hyperplane Optik. ,vol. 126, pp. 2405- 2410 ,(2015) , 10.1016/J.IJLEO.2015.06.010
Edgar Osuna, Osberth De Castro, Convex Hull in Feature Space for Support Vector Machines ibero american conference on ai. pp. 411- 419 ,(2002) , 10.1007/3-540-36131-6_42
Jair Cervantes, Xiaoou Li, Wen Yu, Support Vector Machine Classification Based on Fuzzy Clustering for Large Data Sets Lecture Notes in Computer Science. pp. 572- 582 ,(2006) , 10.1007/11925231_54
Vladimir Naumovich Vapnik, Estimation of Dependences Based on Empirical Data ,(2010)
Erin J. Bredensteiner, Kristin P. Bennett, Duality and Geometry in SVM Classifiers international conference on machine learning. pp. 57- 64 ,(2000)
Pritish Varadwaj, Neetesh Purohit, Bhumika Arora, Detection of Splice Sites Using Support Vector Machine Communications in Computer and Information Science. ,vol. 40, pp. 493- 502 ,(2009) , 10.1007/978-3-642-03547-0_47