A Unified Framework for Outlier Detection in Trace Data Analysis

作者: Zhiguo Li , Robert J. Baseman , Yada Zhu , Fateh A. Tipu , Noam Slonim

DOI: 10.1109/TSM.2013.2267937

关键词:

摘要: Process trace data (PTD) is an important type in semiconductor manufacturing and has a very large aggregate volume. While mining statistical analysis play key role the quality control of wafers, existence outliers adversely affects applications benefiting from PTD analysis. Due to complexities resultant outlier patterns, this paper proposes unified detection framework which takes advantages complexity reduction using entropy abrupt change cumulative sum (CUSUM) method. To meet practical needs analysis, two-step algorithm taking into account related domain knowledge developed, its effectiveness validated by real sets production example. The experimental results show that proposed method outperforms Fast Greedy Algorithm (FGA) Grubb's test, two commonly used techniques for univariate data.

参考文章(17)
Micheline Kamber, Jiawei Han, Jian Pei, Data Mining: Concepts and Techniques ,(2000)
Douglas C. Montgomery, Introduction to Statistical Quality Control ,(1985)
Zengyou He, Xiaofei Xu, Joshua Zhexue Huang, Shengchun Deng, A fast greedy algorithm for outlier mining Lecture Notes in Computer Science. pp. 567- 576 ,(2006)
Shengchun Deng, Zengyou He, Xiaofei Xu, An Optimization Model for Outlier Detection in Categorical Data arXiv: Databases. ,(2005)
Prabhakar Raghavan, Andreas Arning, Rakesh Agrawal, A linear method for deviation detection in large databases knowledge discovery and data mining. pp. 164- 169 ,(1996)
Zengyou He, Shengchun Deng, Xiaofei Xu, An Optimization Model for Outlier Detection in Categorical Data Lecture Notes in Computer Science. pp. 400- 409 ,(2005) , 10.1007/11538059_42
Wilhelmine Stefansky, Rejecting Outliers in Factorial Designs Technometrics. ,vol. 14, pp. 469- 479 ,(1972) , 10.1080/00401706.1972.10488930
Frank E. Grubbs, Procedures for Detecting Outlying Observations in Samples Technometrics. ,vol. 11, pp. 1- 21 ,(1969) , 10.1080/00401706.1969.10490657