Theoretical and practical considerations of uncertainty and complexity in automated knowledge acquisition

作者: X.-J.M. Zhou , T.S. Dillon

DOI: 10.1109/69.469826

关键词:

摘要: Inductive machine learning has become an important approach to automated knowledge acquisition from databases. The disjunctive normal form (DNF), as the common analytic representation of decision trees and tables (rules), provides a basis for formal analysis uncertainty complexity in inductive learning. A theory general is developed based on C. Shannon's (1949) expansion discrete DNF, probabilistic induction system PIK further extracting real world data. Then we combine practical approaches study how data characteristics affect Three characteristics, namely, disjunctiveness, noise incompleteness, are studied. combination leveled pruning, condensing resampling estimation turns out be very powerful method dealing with highly inadequate Finally compared other recent systems number domains. >

参考文章(20)
Tim Niblett, Constructing Decision Trees in Noisy Domains. EWSL. pp. 67- 78 ,(1987)
Giulia Pagallo, Learning DNF by decision trees international joint conference on artificial intelligence. pp. 639- 644 ,(1989)
K. A. Horn, P. J. Compton, L. Lazarus, J. R. Quinlan, Inductive knowledge acquisition: a case study Proceedings of the Second Australian Conference on Applications of expert systems. pp. 137- 156 ,(1987)
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
Peter Clark, Robin Boswell, Rule induction with CN2: Some recent improvements Lecture Notes in Computer Science. pp. 151- 163 ,(1991) , 10.1007/BFB0017011
Qing Ren Wang, Ching Y. Suen, Analysis and Design of a Decision Tree Based on Entropy Reduction and Its Application to Large Character Set Recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-6, pp. 406- 417 ,(1984) , 10.1109/TPAMI.1984.4767546
Barry H. Margolin, Richard J. Light, An Analysis of Variance for Categorical Data, II: Small Sample Comparisons with Chi Square and other Competitors Journal of the American Statistical Association. ,vol. 69, pp. 755- 764 ,(1974) , 10.1080/01621459.1974.10480201
Wray Buntine, Inductive knowledge acquisition and induction methodologies Knowledge Based Systems. ,vol. 2, pp. 52- 61 ,(1989) , 10.1016/0950-7051(89)90008-7