作者: Sylvain Halle
关键词:
摘要: The accuracy of the rules produced by a concept learning system can be hindered presence errors in data, such as "ill-defined" attributes that are too general or specific for to learn. In this paper, we devise method uses Boolean differences computed program called Newton identify multiple ill-defined dataset single pass. is based on compound heuristic assigns real-valued rank each possible hypothesis its key characteristics. We show extensive empirical testing randomly generated classifiers with highest correct one an observed probability quickly converging 100%. Moreover, monotonicity function enables us use it rough estimator own likelihood.