Optimal personalized treatment learning models with insurance applications

作者: Leo Guelman

DOI:

关键词:

摘要: In many important settings, subjects can show significant heterogeneity in response to a stimulus or “treatment". For instance, treatment that works for the overall population might be highly inefiective, even harmful, subgroup of with specific characteristics. Similarly, new may not better than an existing population, but there is likely who would benefit from it. The notion “one size fit all" becoming increasingly recognized wide variety fields, ranging economics medicine. This has drawn attention personalize choice treatment, so it optimal each individual. An personalized one maximizes probability desirable outcome. We call task learning (PTL). From statistical perspective, building PTL models imposes challenges, primarily because unknown on given training data set. this thesis, we formalize problem causal inference perspective and provide comprehensive description methods solve problem. contribute literature by proposing two novel methods, namely uplift random forests conditional forests. Our proposal outperforms based extensive numerical simulation real-world data. Next, introduce concept insurance marketing pricing applications. particular, Insurance these areas optimize client retention cross-selling experimental also illustrate application price-elasticity estimation economic price optimization context observational field, selection requires consideration expected losses individual policyholder within portfolio. non-life ratemaking gradient boosting estimate loss cost, key advantages over conventional generalized linear model approach. A facing research been lack publicly available software models. implement most fitting models, as well our proposed ones, package named uplift, which now released freely CRAN (Comprehensive R Archive Network) repository under computing environment.

参考文章(150)
Paul R. Rosenbaum, A Characterization of Optimal Designs for Observational Studies Journal of the royal statistical society series b-methodological. ,vol. 53, pp. 597- 610 ,(1991) , 10.1111/J.2517-6161.1991.TB01848.X
A. P. Dawid, Conditional Independence in Statistical Theory Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 41, pp. 1- 15 ,(1979) , 10.1111/J.2517-6161.1979.TB01052.X
Salvatore J. Stolfo, Philip K. Chan, Toward scalable learning with non-uniform class and cost distributions: a case study in credit card fraud detection knowledge discovery and data mining. pp. 164- 168 ,(1998)
Richard A. Levine, Xiaogang Su, Joseph Kang, Juanjuan Fan, Xin Yan, Facilitating score and causal inference trees for large observational studies Journal of Machine Learning Research. ,vol. 13, pp. 2955- 2994 ,(2012) , 10.5555/2503308.2503336
Leo Guelman, Montserrat Guillén, Ana M Pérez-Marín, None, Random Forests for Uplift Modeling: An Insurance Customer Retention Case International Conference on Modeling and Simulation in Engineering, Economics and Management. pp. 123- 133 ,(2012) , 10.1007/978-3-642-30433-0_13
Scott A. Neslin, Robert C. Blattberg, Pyŏng-do Kim, Database Marketing: Analyzing and Managing Customers ,(2008)
Paul R. Rosenbaum, Design of Observational Studies ,(2010)
Robert Tibshirani, Trevor Hastie, Jerome H. Friedman, The Elements of Statistical Learning ,(2001)