作者: Doktorska Disertacija
DOI:
关键词:
摘要: This dissertation investigates how to adapt standard classification rule learning approaches subgroup discovery. The goal of discovery is find rules describing subsets a selected population that are sufficiently large and statistically unusual in terms class distribution. presents algorithm, CN2-SD, developed by modifying parts the CN2 learner: its covering search heuristic, probabilistic instances, evaluation measures. Experimental CN2-SD on data sets shows substantial reduction number induced rules, increased coverage, significance overall coverage target concept as well slight improvements area under ROC curve, when compared with algorithms RIPPER. An application traffic accident set confirms these findings. also algorithm APRIORI-SD, adapting association was achieved building learner APRIORI-C, enhanced novel post–processing mechanism, new quality measure for (weighted relative accuracy) using instances. results similar behavior APRIORI-SD i.e. CN2, RIPPER APRIORI-C. A optimization approach based analysis presented implemented an adaptation algorithm. implications “number-of-rules–unusualness–coverage” trade off investigated through experimental adapted sets. form 2D graphs depicting dependencies between unusualness, accuracy original xi xii