Benchmarking Support Vector Machines Implementation Using Multiple Techniques

作者: M. V. Sukanya , Shiju Sathyadevan , U. B. Unmesha Sreeveni

DOI: 10.1007/978-3-319-11218-3_22

关键词:

摘要: Data management becomes a complex task when hundreds of petabytes data are being gathered, stored and processed on day to basis. Efficient processing the exponentially growing is inevitable in this context. This paper discusses about huge amount through Support Vector machine (SVM) algorithm using different techniques ranging from single node Linier implementation parallel distributed frameworks like Hadoop. Map-Reduce component Hadoop performs parallelization process which used feed information Machines (SVMs), learning applicable classification regression analysis. Paper also does detailed anatomy SVM sets roadmap for implementing same both linear fashion. The main objective explain detail steps involved developing an scratch standard conduct performance analysis across SVM, Hadoop, cluster against proven tool R, gauging them with respect accuracy achieved, their pace varying sizes, capability handle volume without breaking etc.

参考文章(6)
Hwanjo Yu, Jiong Yang, Jiawei Han, Classifying large data sets using SVMs with hierarchical clusters knowledge discovery and data mining. pp. 306- 315 ,(2003) , 10.1145/956750.956786
Cheng-tao Chu, Sang Kim, Yi-An Lin, YuanYuan Yu, Gary Bradski, Kunle Olukotun, Andrew Ng, None, Map-Reduce for Machine Learning on Multicore neural information processing systems. ,vol. 19, pp. 281- 288 ,(2006)
Jeffrey Dean, Sanjay Ghemawat, MapReduce: a flexible data processing tool Communications of The ACM. ,vol. 53, pp. 72- 77 ,(2010) , 10.1145/1629175.1629198
Jeffrey Dean, Sanjay Ghemawat, MapReduce Communications of the ACM. ,vol. 51, pp. 107- 113 ,(2008) , 10.1145/1327452.1327492