Distance-based feature selection from probabilistic data

作者: Tingting Zhao , Bin Pei , Suyun Zhao , Hong Chen , Cuiping Li

DOI: 10.1007/978-3-642-38562-9_29

关键词: RandomnessDimensionality reductionData miningComputer scienceFeature (computer vision)Noise (video)Feature selectionDivergence-from-randomness modelMachine learningProbabilistic logicProbabilistic analysis of algorithmsArtificial intelligence

摘要: Feature selection is a powerful tool of dimension reduction from datasets. In the last decade, more and researchers have paid attentions on feature selection. Further, some begin to focus probabilistic However, in existing method data, distance hidden data neglected. this paper, we design new measure select informative databases, which both randomness are considered. And then, propose algorithm based develop two accelerative algorithms boost computation. Furthermore, introduce parameter into reduce sensitivity noise. Finally, experimental results verify effectiveness our algorithms.

参考文章(8)
Ramakrishnan Srikant, Rakesh Agrawal, Mining Generalized Association Rules very large data bases. pp. 407- 419 ,(1995)
Mark Andrew Hall, Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning international conference on machine learning. pp. 359- 366 ,(2000)
Smith Tsang, Ben Kao, Kevin Y. Yip, Wai-Shing Ho, Sau Dan Lee, Decision Trees for Uncertain Data IEEE Transactions on Knowledge and Data Engineering. ,vol. 23, pp. 64- 78 ,(2011) , 10.1109/TKDE.2009.175
Wang Ngai, Ben Kao, Chun Chui, Reynold Cheng, Michael Chau, Kevin Yip, Efficient Clustering of Uncertain Data international conference on data mining. pp. 436- 445 ,(2006) , 10.1109/ICDM.2006.63
A.D. Sarma, O. Benjelloun, A. Halevy, J. Widom, Working Models for Uncertain Data international conference on data engineering. pp. 7- 7 ,(2006) , 10.1109/ICDE.2006.174
C.C. Aggarwal, P.S. Yu, A Survey of Uncertain Data Algorithms and Applications IEEE Transactions on Knowledge and Data Engineering. ,vol. 21, pp. 609- 623 ,(2009) , 10.1109/TKDE.2008.190
Jiangtao Ren, Sau Dan Lee, Xianlu Chen, Ben Kao, Reynold Cheng, David Cheung, Naive Bayes Classification of Uncertain Data international conference on data mining. pp. 944- 949 ,(2009) , 10.1109/ICDM.2009.90
Manoranjan Dash, Huan Liu, Consistency-based search in feature selection Artificial Intelligence. ,vol. 151, pp. 155- 176 ,(2003) , 10.1016/S0004-3702(03)00079-1