Collapsing Bandits and Their Application to Public Health Interventions

作者： Andrew Perrault , Haifeng Xu , Milind Tambe , Jackson A. Killian , Aditya Mate

DOI:

关键词:

摘要: We propose and study Collpasing Bandits, a new restless multi-armed bandit (RMAB) setting in which each arm follows binary-state Markovian process with special structure: when an is played, the state fully observed, thus "collapsing" any uncertainty, but passive, no observation made, allowing uncertainty to evolve. The goal keep as many arms "good" possible by planning limited budget of actions per round. Such Collapsing Bandits are natural models for healthcare domains workers must simultaneously monitor patients deliver interventions way that maximizes health their patient cohort. Our main contributions follows: (i) Building on Whittle index technique RMABs, we derive conditions under problem indexable. derivation hinges novel characterize optimal policies may take form either "forward" or "reverse" threshold policies. (ii) exploit optimality build fast algorithms computing index, including closed-form. (iii) evaluate our algorithm several data distributions from real-world task worker maximize patients' adherence tuberculosis medication. achieves 3-order-of-magnitude speedup compared state-of-the-art RMAB techniques while achieving similar performance.

参考文章(29)

S Kenya, N Chida, S Symes, G Shor-Posner, Can community health workers improve adherence to highly active antiretroviral therapy in the USA? A review of the literature. Hiv Medicine. ,vol. 12, pp. 525- 534 ,(2011) , 10.1111/J.1468-1293.2011.00921.X

M. O'Keeffe, P. S. Ansell, K. D. Glazebrook, J. Ni�o-Mora, Whittle's index policy for a multi-class queueing system with convex holding costs Mathematical Methods of Operations Research. ,vol. 57, pp. 21- 39 ,(2003) , 10.1007/S001860200257

Jane Rahedi Ong'ang'o, Christina Mwachari, Hillary Kipruto, Simon Karanja, The Effects on Tuberculosis Treatment Adherence from Utilising Community Health Workers: A Comparison of Selected Rural and Urban Settings in Kenya PLoS ONE. ,vol. 9, pp. e88937- ,(2014) , 10.1371/JOURNAL.PONE.0088937

Prajit K. Dutta, What do discounted optima converge to?: A theory of discount rate asymptotics in economic models Journal of Economic Theory. ,vol. 55, pp. 64- 94 ,(1991) , 10.1016/0022-0531(91)90059-D

Richard D. Smallwood, Edward J. Sondik, The Optimal Control of Partially Observable Markov Processes over a Finite Horizon Operations Research. ,vol. 21, pp. 1071- 1088 ,(1973) , 10.1287/OPRE.21.5.1071

Sonjia Kenya, Jamal Jones, Kristopher Arheart, Erin Kobetz, Natasha Chida, Shelly Baer, Alexis Powell, Stephen Symes, Tai Hunte, Anne Monroe, Olveen Carrasquillo, Using Community Health Workers to Improve Clinical Outcomes Among People Living with HIV: A Randomized Controlled Trial Aids and Behavior. ,vol. 17, pp. 2927- 2934 ,(2013) , 10.1007/S10461-013-0440-1

Edward J. Sondik, The Optimal Control of Partially Observable Markov Processes over the Infinite Horizon: Discounted Costs Operations Research. ,vol. 26, pp. 282- 304 ,(1978) , 10.1287/OPRE.26.2.282

Richard R. Weber, Gideon Weiss, ON AN INDEX POLICY FOR RESTLESS BANDITS Journal of Applied Probability. ,vol. 27, pp. 637- 648 ,(1990) , 10.2307/3214547

K. D. Glazebrook, D. Ruiz-Hernandez, C. Kirkbride, Some indexable families of restless bandit problems Advances in Applied Probability. ,vol. 38, pp. 643- 672 ,(2006) , 10.1239/AAP/1158684996

10.

Bernd L??we, J??rgen Un??tzer, Christopher M. Callahan, Anthony J. Perkins, Kurt Kroenke, Monitoring depression treatment outcomes with the Patient Health Questionnaire-9 Medical Care. ,vol. 42, pp. 1194- 1201 ,(2004) , 10.1097/00005650-200412000-00006

Collapsing Bandits and Their Application to Public Health Interventions

来源期刊

我的账户

Collapsing Bandits and Their Application to Public Health Interventions

来源期刊

相似文章 6

Simulation Based Algorithms for Markov Decision Processes and Multi-Action Restless Bandits.

Selective Intervention Planning using RMABs: Increasing Program Engagement to Improve Maternal and Child Health Outcomes

Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems.

Selective Intervention Planning using Restless Multi-Armed Bandits to Improve Maternal and Child Health Outcomes.

Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare.

Unpacking the Expressed Consequences of AI Research in Broader Impact Statements.

我的账户