Local peculiarity oriented data mining and its application in outlier detection

作者: JIAN YANG , NING ZHONG , YIYU YAO , JUE WANG

DOI: 10.1142/S0219622012500319

关键词:

摘要: Peculiarity oriented mining (POM), aimed at discovering peculiarity rules hidden in a dataset, is data method. factor (PF) one of the most important concepts POM. In this paper, it proved that PF can accurately characterize sampled from normal distribution. However, for general one-dimensional distribution, does not have property. A local version PF, called LPF, proposed to solve difficulty. LPF effectively describe continuous Based on framework presented, which consists two steps, namely, peculiar identification and analysis. Two algorithms case study analysis are given make practical. Experiments several benchmark datasets show their good performance.

参考文章(45)
Naoki Abe, Bianca Zadrozny, John Langford, Outlier detection by active learning knowledge discovery and data mining. pp. 504- 509 ,(2006) , 10.1145/1150402.1150459
Jian Yang, Ning Zhong, Yiyu Yao, Jue Wang, Local peculiarity factor and its application in outlier detection Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 08. pp. 776- 784 ,(2008) , 10.1145/1401890.1401983
Muneaki Ohshima, Ning Zhong, YiYu Yao, Chunnian Liu, Relational peculiarity-oriented mining Data Mining and Knowledge Discovery. ,vol. 15, pp. 249- 273 ,(2007) , 10.1007/S10618-006-0046-6
Bin Shen, Min Yao, Zhaohui Wu, Yunjun Gao, Mining dynamic association rules with comments Knowledge and Information Systems. ,vol. 23, pp. 73- 98 ,(2010) , 10.1007/S10115-009-0207-1
Zengyou He, Xiaofei Xu, Shengchun Deng, Discovering cluster-based local outliers Pattern Recognition Letters. ,vol. 24, pp. 1641- 1650 ,(2003) , 10.1016/S0167-8655(03)00003-5
Yiyu Yao, Fei-Yue Wang, Jue Wang, Daniel Zeng, Rule+exception strategies for security information analysis IEEE Intelligent Systems. ,vol. 20, pp. 52- 57 ,(2005) , 10.1109/MIS.2005.93
Andrew McCallum, Kamal Nigam, Lyle H. Ungar, Efficient clustering of high-dimensional data sets with application to reference matching knowledge discovery and data mining. pp. 169- 178 ,(2000) , 10.1145/347090.347123
JUNLIN ZHOU, ALEKSANDAR LAZAREVIC, KUO-WEI HSU, JAIDEEP SRIVASTAVA, YAN FU, YUE WU, UNSUPERVISED LEARNING BASED DISTRIBUTED DETECTION OF GLOBAL ANOMALIES International Journal of Information Technology and Decision Making. ,vol. 9, pp. 935- 957 ,(2010) , 10.1142/S0219622010004172
Yong Shi, Li Zhang, COID: A cluster–outlier iterative detection approach to multi-dimensional data analysis Knowledge and Information Systems. ,vol. 28, pp. 709- 733 ,(2011) , 10.1007/S10115-010-0323-Y
Nitesh V. Chawla, Nathalie Japkowicz, Aleksander Kotcz, Editorial ACM SIGKDD Explorations Newsletter. ,vol. 6, pp. 1- 6 ,(2004) , 10.1145/1007730.1007733