Interpretable Machine Learning -- A Brief History, State-of-the-Art and Challenges

作者: Bernd Bischl , Giuseppe Casalicchio , Christoph Molnar

DOI: 10.1007/978-3-030-65965-3_28

关键词:

摘要: We present a brief history of the field interpretable machine learning (IML), give an overview state-of-the-art interpretation methods and discuss challenges. Research in IML has boomed recent years. As young as is, it over 200 years old roots regression modeling rule-based learning, starting 1960s. Recently, many new have been proposed, them model-agnostic, but also techniques specific to deep tree-based ensembles. either directly analyze model components, study sensitivity input perturbations, or local global surrogate approximations ML model. The approaches state readiness stability, with not only proposed research, implemented open-source software. But important challenges remain for IML, such dealing dependent features, causal interpretation, uncertainty estimation, which need be resolved its successful application scientific problems. A further challenge is missing rigorous definition interpretability, accepted by community. To address advance field, we urge recall our interpretable, data-driven statistics (rule-based) ML, consider other areas analysis, inference, social sciences.

参考文章(104)
Pengfei Wei, Zhenzhou Lu, Jingwen Song, Variable importance analysis: A comprehensive review Reliability Engineering & System Safety. ,vol. 142, pp. 399- 432 ,(2015) , 10.1016/J.RESS.2015.05.018
Johannes Fürnkranz, Dragan Gamberger, Nada Lavrač, Foundations of Rule Learning Cognitive Technologies. pp. 353- 353 ,(2012) , 10.1007/978-3-540-75197-7
Lloyd S. Shapley, A Value for n-person Games Contributions to the Theory of Games. pp. 307- 317 ,(1952) , 10.1017/CBO9780511528446.003
Jerome H. Friedman, Greedy function approximation: A gradient boosting machine. Annals of Statistics. ,vol. 29, pp. 1189- 1232 ,(2001) , 10.1214/AOS/1013203451
Matthew D. Zeiler, Rob Fergus, Visualizing and Understanding Convolutional Networks european conference on computer vision. pp. 818- 833 ,(2014) , 10.1007/978-3-319-10590-1_53
Carolin Strobl, Anne-Laure Boulesteix, Achim Zeileis, Torsten Hothorn, Bias in random forest variable importance measures: Illustrations, sources and a solution BMC Bioinformatics. ,vol. 8, pp. 25- 25 ,(2007) , 10.1186/1471-2105-8-25
Rich Caruana, Yin Lou, Johannes Gehrke, Paul Koch, Marc Sturm, Noemie Elhadad, Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission knowledge discovery and data mining. pp. 1721- 1730 ,(2015) , 10.1145/2783258.2788613