Using Support Vector Machines for Survey Research

作者: Antje Kirchner , Curtis S. Signorino

DOI: 10.29115/SP-2018-0001

关键词: Binary classificationLogistic regressionImputation (statistics)Record linkageComputer scienceSurvey researchSmall numberMachine learningArtificial intelligenceSupport vector machineCategorical variable

摘要: Recent developments in machine learning allow for flexible functional form estimation beyond the approaches typically used by survey researchers and social scientists. Support vector machines (SVMs) are one such technique, commonly binary classification problems, as whether or not an individual decides to participate a survey. Since their inception, SVMs have been extended solve categorical regression problems. Their versatility combination with fact that they perform well presence of large number predictors, even small cases, makes them very appealing wide range including character recognition text classification, speech speaker verification, imputation problems record linkage. In this article, we provide non-technical introduction main concepts SVMs, discuss advantages disadvantages, present ideas how can be research, and, finally, hands-on example, code, research results compare traditional logistic regression.

参考文章(13)
Kristin P. Bennett, Colin Campbell, Support vector machines ACM SIGKDD Explorations Newsletter. ,vol. 2, pp. 1- 13 ,(2000) , 10.1145/380995.380999
Peter Christen, Automatic record linkage using seeded nearest neighbour and support vector machine classification Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 08. pp. 151- 159 ,(2008) , 10.1145/1401890.1401913
David L. Olson, Dursun Delen, Yanyan Meng, Comparative analysis of data mining methods for bankruptcy prediction decision support systems. ,vol. 52, pp. 464- 473 ,(2012) , 10.1016/J.DSS.2011.10.007
Bernhard E. Boser, Isabelle M. Guyon, Vladimir N. Vapnik, A training algorithm for optimal margin classifiers conference on learning theory. pp. 144- 152 ,(1992) , 10.1145/130385.130401
A. Malyscheff, T. Trafalis, Support vector machines and the electoral college international joint conference on neural network. ,vol. 3, pp. 2344- 2348 ,(2003) , 10.1109/IJCNN.2003.1223778
Chao Lu, Xue-wei Li, Hong-bo Pan, Application of SVM and Fuzzy Set Theory for Classifying with Incomplete Survey Data international conference on service systems and service management. pp. 1- 4 ,(2007) , 10.1109/ICSSSM.2007.4280164
Chih-Chung Chang, Chih-Jen Lin, LIBSVM ACM Transactions on Intelligent Systems and Technology. ,vol. 2, pp. 1- 27 ,(2011) , 10.1145/1961189.1961199
Dapeng Cui, David Curry, Prediction in Marketing Using the Support Vector Machine Marketing Science. ,vol. 24, pp. 595- 615 ,(2005) , 10.1287/MKSC.1050.0123