Evaluation of k-Nearest Neighbor classifier performance for direct marketing

作者: M. Govindarajan , RM. Chandrasekaran

DOI: 10.1016/J.ESWA.2009.04.055

关键词: Cross-validationPrecision and recallSupervised learningData miningPattern recognitionExploratory data analysisk-nearest neighbors algorithmClassifier (UML)Artificial intelligenceComputer scienceDirect marketing

摘要: Text data mining is a process of exploratory analysis. Classification maps into predefined groups or classes. It often referred to as supervised learning because the classes are determined before examining data. This paper describes proposed k-Nearest Neighbor classifier that performs comparative cross-validation for existing classifier. The feasibility and benefits approach demonstrated by means problem: direct marketing. Direct marketing has become an important application field mining. Comparative involves estimation accuracy either stratified k-fold equivalent repeated random subsampling. While method may have high bias; its performance (accuracy in our case) be poor due variance. Thus with was less than classifier, smaller improvement runtime larger precision recall. In we classification prediction where comparatively high.

参考文章(26)
Connie L. Bauer, A direct mail customer purchase model Journal of Direct Marketing. ,vol. 2, pp. 16- 24 ,(1988) , 10.1002/DIR.4000020305
Laurenz Wiskott, J-M Fellous, Norbert Kruger, Christopher von der Malsburg, Face recognition by elastic bunch graph matching IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 19, pp. 775- 779 ,(1997) , 10.1109/34.598235
S TAN, An effective refinement strategy for KNN text classifier Expert Systems With Applications. ,vol. 30, pp. 290- 298 ,(2006) , 10.1016/J.ESWA.2005.07.019
T. Cover, P. Hart, Nearest neighbor pattern classification IEEE Transactions on Information Theory. ,vol. 13, pp. 21- 27 ,(1967) , 10.1109/TIT.1967.1053964
Hyoung-joo Lee, Sungzoon Cho, Focusing on non-respondents: Response modeling with novelty detectors Expert Systems With Applications. ,vol. 33, pp. 522- 530 ,(2007) , 10.1016/J.ESWA.2006.05.016
J.M. Sousa, U. Kaymak, S. Madeira, A comparative study of fuzzy target selection methods in direct marketing ieee international conference on fuzzy systems. ,vol. 2, pp. 1251- 1256 ,(2002) , 10.1109/FUZZ.2002.1006683
Matthew Turk, Alex Pentland, Eigenfaces for recognition Journal of Cognitive Neuroscience. ,vol. 3, pp. 71- 86 ,(1991) , 10.1162/JOCN.1991.3.1.71
Micheline Kamber, Jiawei Han, Jian Pei, Data Mining: Concepts and Techniques ,(2000)
Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, Richard Harshman, Indexing by Latent Semantic Analysis Journal of the Association for Information Science and Technology. ,vol. 41, pp. 391- 407 ,(1990) , 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9
H. Samet, Depth-first k-nearest neighbor finding using the MaxNearestDist estimator international conference on image analysis and processing. pp. 486- 491 ,(2003) , 10.1109/ICIAP.2003.1234097