Collapsing Bandits and Their Application to Public Health Interventions

作者: Andrew Perrault , Haifeng Xu , Milind Tambe , Jackson A. Killian , Aditya Mate

DOI:

关键词:

摘要: We propose and study Collpasing Bandits, a new restless multi-armed bandit (RMAB) setting in which each arm follows binary-state Markovian process with special structure: when an is played, the state fully observed, thus "collapsing" any uncertainty, but passive, no observation made, allowing uncertainty to evolve. The goal keep as many arms "good" possible by planning limited budget of actions per round. Such Collapsing Bandits are natural models for healthcare domains workers must simultaneously monitor patients deliver interventions way that maximizes health their patient cohort. Our main contributions follows: (i) Building on Whittle index technique RMABs, we derive conditions under problem indexable. derivation hinges novel characterize optimal policies may take form either "forward" or "reverse" threshold policies. (ii) exploit optimality build fast algorithms computing index, including closed-form. (iii) evaluate our algorithm several data distributions from real-world task worker maximize patients' adherence tuberculosis medication. achieves 3-order-of-magnitude speedup compared state-of-the-art RMAB techniques while achieving similar performance.

参考文章(29)
M. O'Keeffe, P. S. Ansell, K. D. Glazebrook, J. Ni�o-Mora, Whittle's index policy for a multi-class queueing system with convex holding costs Mathematical Methods of Operations Research. ,vol. 57, pp. 21- 39 ,(2003) , 10.1007/S001860200257
Richard D. Smallwood, Edward J. Sondik, The Optimal Control of Partially Observable Markov Processes over a Finite Horizon Operations Research. ,vol. 21, pp. 1071- 1088 ,(1973) , 10.1287/OPRE.21.5.1071
Sonjia Kenya, Jamal Jones, Kristopher Arheart, Erin Kobetz, Natasha Chida, Shelly Baer, Alexis Powell, Stephen Symes, Tai Hunte, Anne Monroe, Olveen Carrasquillo, Using Community Health Workers to Improve Clinical Outcomes Among People Living with HIV: A Randomized Controlled Trial Aids and Behavior. ,vol. 17, pp. 2927- 2934 ,(2013) , 10.1007/S10461-013-0440-1
Richard R. Weber, Gideon Weiss, ON AN INDEX POLICY FOR RESTLESS BANDITS Journal of Applied Probability. ,vol. 27, pp. 637- 648 ,(1990) , 10.2307/3214547
K. D. Glazebrook, D. Ruiz-Hernandez, C. Kirkbride, Some indexable families of restless bandit problems Advances in Applied Probability. ,vol. 38, pp. 643- 672 ,(2006) , 10.1239/AAP/1158684996
Bernd L??we, J??rgen Un??tzer, Christopher M. Callahan, Anthony J. Perkins, Kurt Kroenke, Monitoring depression treatment outcomes with the Patient Health Questionnaire-9 Medical Care. ,vol. 42, pp. 1194- 1201 ,(2004) , 10.1097/00005650-200412000-00006