Classifier evaluation and attribute selection against active adversaries

作者: Murat Kantarcıoğlu , Bowei Xi , Chris Clifton

DOI: 10.1007/S10618-010-0197-3

关键词: Equilibrium pointData miningIntrusion detection systemPopulationFeature selectionAdversaryGame theoryFalse positive paradoxMachine learningClassifier (UML)Computer scienceArtificial intelligence

摘要: Many data mining applications, such as spam filtering and intrusion detection, are faced with active adversaries. In all these the future sets training set no longer from same population, due to transformations employed by Hence a main assumption for existing classification techniques holds initially successful classifiers degrade easily. This becomes game between adversary miner: The modifies its strategy avoid being detected current classifier; miner then updates classifier based on new threats. this paper, we investigate possibility of an equilibrium in seemingly never ending game, where neither party has incentive change. Modifying causes too many false positives little increase true positives; changes decrease utility negative items that not detected. We develop theoretic framework behavior adversarial applications can be analyzed, provide solutions finding point. A classifier's performance indicates eventual success or failure. could select attributes their performance, construct effective classifier. case study online lending demonstrates how apply proposed real application.

参考文章(24)
George Casella, Christian P. Robert, Monte Carlo Statistical Methods (Springer Texts in Statistics) Springer-Verlag New York, Inc.. ,(2005)
Calton Pu, Steve Webb, Observed Trends in Spam Construction Techniques: A Case Study of Spam Evolution. conference on email and anti-spam. ,(2006)
Thomas Vallée, Tamer Başar, Off-Line Computation of Stackelberg Solutions with the Genetic Algorithm Computing in Economics and Finance. ,vol. 13, pp. 201- 209 ,(1999) , 10.1023/A:1008652106422
David G. Stork, Richard O. Duda, Peter E. Hart, Pattern Classification (2nd ed.) ,(1999)
Tom Fawcett, Foster Provost, Adaptive Fraud Detection Data Mining and Knowledge Discovery. ,vol. 1, pp. 291- 316 ,(1997) , 10.1023/A:1009700419189
John C. Mitchell, Elizabeth Stinson, Towards systematic evaluation of the evadability of bot/botnet detection methods usenix security symposium. pp. 5- ,(2008)
Nicolo Cesa-Bianchi, Gabor Lugosi, Prediction, learning, and games ,(2006)
Geert Jan Olsder, Tamer Başar, Dynamic Noncooperative Game Theory ,(1982)
Keinosuke Fukunaga, Introduction to statistical pattern recognition (2nd ed.) Academic Press Professional, Inc.. ,(1990)