Outlier detection by active learning

作者: Naoki Abe , Bianca Zadrozny , John Langford

DOI: 10.1145/1150402.1150459

关键词:

摘要: Most existing approaches to outlier detection are based on density estimation methods. There two notable issues with these methods: one is the lack of explanation for flagging decisions, and other relatively high computational requirement. In this paper, we present a novel approach classification, in an attempt address both issues. Our isbased key ideas. First, simple reduction via procedure that involves applying classification labeled data set containing artificially generated examples play role potential outliers. Once task has been reduced then invoke selective sampling mechanism active learning problem. We empirically evaluate proposed using number sets, find our method superior methods same but standard also show it competitive state-of-the-art literature estimation, while significantly improving complexity explanatory power.

参考文章(16)
Raymond T. Ng, Edwin M. Knorr, Algorithms for Mining Distance-Based Outliers in Large Datasets very large data bases. pp. 392- 403 ,(1998)
Hiroshi Mamitsuka, Naoki Abe, Efficient Mining from Large Databases by Query Learning international conference on machine learning. pp. 575- 582 ,(2000)
Yoav Freund, Robert E Schapire, A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting conference on learning theory. ,vol. 55, pp. 119- 139 ,(1997) , 10.1006/JCSS.1997.1504
Shai Ben-David, Michael Lindenbaum, Learning distributions by their density levels: a paradigm for learning without a teacher conference on learning theory. ,vol. 55, pp. 171- 182 ,(1997) , 10.1006/JCSS.1997.1507
James P. Theiler, D. M. Cai, Resampling approach for anomaly detection in multispectral images Algorithms and Technologies for Multispectral, Hyperspectral, and Ultraspectral Imagery IX. ,vol. 5093, pp. 230- 240 ,(2003) , 10.1117/12.487069
Hiroshi Mamitsuka, Naoki Abe, Query Learning Strategies Using Boosting and Bagging international conference on machine learning. pp. 1- 9 ,(1998)
Aleksandar Lazarevic, Vipin Kumar, Feature bagging for outlier detection knowledge discovery and data mining. pp. 157- 166 ,(2005) , 10.1145/1081870.1081891
Charu C. Aggarwal, Philip S. Yu, Outlier detection for high dimensional data international conference on management of data. ,vol. 30, pp. 37- 46 ,(2001) , 10.1145/375663.375668
H. S. Seung, M. Opper, H. Sompolinsky, Query by committee conference on learning theory. pp. 287- 294 ,(1992) , 10.1145/130385.130417
Prem Melville, Raymond J. Mooney, Diverse ensembles for active learning Twenty-first international conference on Machine learning - ICML '04. pp. 74- ,(2004) , 10.1145/1015330.1015385