Helping predictive analytics interpretation using regression trees and clustering perturbation

作者: Olivier Parisot , Yoanne Didry , Thomas Tamisier , Benoît Otjacques

DOI: 10.1080/12460125.2015.994331

关键词:

摘要: Regression trees are helpful tools for decision support and predictive analytics, due to their simple structure the ease with which they can be obtained from data. Nonetheless, when applied non-trivial datasets, tend grow according complexity of data, becoming difficult interpret. This difficulty overcome by clustering dataset representing regression tree each cluster independently. In order help create models that more comprehensible, we propose in this work a perturbation method reduce size cluster. A prototype has been developed tested on several datasets.

参考文章(42)
Stef van den Elzen, Jarke J. van Wijk, BaobabView: Interactive construction and analysis of decision trees visual analytics science and technology. pp. 151- 160 ,(2011) , 10.1109/VAST.2011.6102453
Niklas Lavesson, Hiva Allahyari, User-oriented Assessment of Classification Model Understandability scandinavian conference on ai. pp. 11- 19 ,(2011) , 10.3233/978-1-60750-754-3-11
Olivier Parisot, Pierrick Bruneau, Yoanne Didry, Thomas Tamisier, User-Driven Data Preprocessing for Decision Support cooperative design visualization and engineering. pp. 81- 84 ,(2013) , 10.1007/978-3-642-40840-3_13
Veena H. Bhat, Prasanth G. Rao, S. Krishna, P. Deepa Shenoy, K. R. Venugopal, L. M. Patnaik, An Efficient Framework for Prediction in Healthcare Data Using Soft Computing Techniques advances in computing and communications. pp. 522- 532 ,(2011) , 10.1007/978-3-642-22720-2_55
Mark A. Hall, Ian H. Witten, Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques ,(1999)
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
T. Elomaa, M. Kaariainen, An analysis of reduced error pruning Journal of Artificial Intelligence Research. ,vol. 15, pp. 163- 187 ,(2001) , 10.1613/JAIR.816
Sreerama K. Murthy, Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey Data Mining and Knowledge Discovery. ,vol. 2, pp. 345- 389 ,(1998) , 10.1023/A:1009744630224
Gregor Stiglic, Simon Kocbek, Igor Pernek, Peter Kokol, Comprehensive Decision Tree Models in Bioinformatics PLoS ONE. ,vol. 7, pp. e33812- 13 ,(2012) , 10.1371/JOURNAL.PONE.0033812