A Multiarmed Bandit Approach to Adaptive Water Quality Management.

作者: David M Martin , Fred A Johnson

DOI: 10.1002/IEAM.4302

关键词: Water quality managementProbability matchingConservationSample (statistics)Probability of successBayesian probabilityContext (language use)Adaptive managementComputer scienceOperations research

摘要: Nonpoint source water quality management is challenged with allocating uncertain actions and monitoring their performance in the absence of state-dependent decision making. This adaptive context can be expressed as a multiarmed bandit problem. Multiarmed strategies attempt to balance exploitation that appear maximize exploration uncertain, but potentially better, actions. We performed test inform Massachusetts, USA. Conservation restoration practitioners were tasked household wastewater treatments minimize N inputs impaired waters. obtained time series data from 3 treatment types organized them chronologically randomly. The chronological set represented nonstationary based on recent data, whereas random stationary performance. tested 2 hypothetical experiments sample through 20 sequential decisions. A deterministic probability-matching strategy allocated highest probability success regarding at each decision. randomized randomly according compared nonadaptive equally Results indicated equal allocation useful for learning situations tended overexplore inferior thus did not when other strategies. Deterministic matching maximized many situations, adequately explore converged situations. Randomized balanced could converge These findings provide evidence are management. Integr Environ Assess Manag 2020;16:841-852. © 2020 Authors. Integrated Environmental Assessment Management published by Wiley Periodicals LLC behalf Society Toxicology & Chemistry (SETAC).

参考文章(22)
James Wason, Jack Bowden, Sofía S. Villar, Multi-armed Bandit Models for the Optimal Design of Clinical Trials: Benefits and Challenges. Statistical Science. ,vol. 30, pp. 199- 215 ,(2015) , 10.1214/14-STS504
Lance Gunderson, Stephen S. Light, Adaptive management and adaptive governance in the everglades ecosystem Policy Sciences. ,vol. 39, pp. 323- 334 ,(2007) , 10.1007/S11077-006-9027-2
H. Robbins, A SEQUENTIAL DECISION PROBLEM WITH A FINITE MEMORY Proceedings of the National Academy of Sciences. ,vol. 42, pp. 920- 923 ,(1956) , 10.1073/PNAS.42.12.920
Herbert Robbins, Some aspects of the sequential design of experiments Bulletin of the American Mathematical Society. ,vol. 58, pp. 527- 535 ,(1952) , 10.1090/S0002-9904-1952-09620-8
T.L Lai, Herbert Robbins, Asymptotically efficient adaptive allocation rules Advances in Applied Mathematics. ,vol. 6, pp. 4- 22 ,(1985) , 10.1016/0196-8858(85)90002-8
Byron K. Williams, Eleanor D. Brown, Adaptive management: from more talk to real action. Environmental Management. ,vol. 53, pp. 465- 479 ,(2014) , 10.1007/S00267-013-0205-7
Neha Gupta, Ole-Christoffer Granmo, Ashok Agrawala, Thompson Sampling for Dynamic Multi-armed Bandits international conference on machine learning and applications. ,vol. 1, pp. 484- 489 ,(2011) , 10.1109/ICMLA.2011.144
MICHAEL A. McCARTHY, HUGH P. POSSINGHAM, Active Adaptive Management for Conservation Conservation Biology. ,vol. 21, pp. 956- 963 ,(2007) , 10.1111/J.1523-1739.2007.00677.X
Byron K. Williams, Passive and active adaptive management: Approaches and an example Journal of Environmental Management. ,vol. 92, pp. 1371- 1378 ,(2011) , 10.1016/J.JENVMAN.2010.10.039