Identifying best interventions through online importance sampling

作者: Sanjay Shakkottai , Alexandres G. Dimakis , Rajat Sen , Karthikeyan Shanmugam

DOI:

关键词:

摘要: Motivated by applications in computational advertising and systems biology, we consider the problem of identifying best out several possible soft interventions at a source node V an acyclic causal directed graph, to maximize expected value target Y (located downstream V). Our setting imposes fixed total budget for sampling under various interventions, along with cost constraints on different types interventions. We pose this as arm identification bandit K arms where each is intervention V, leverage information leakage among provide first gap dependent error simple regret bounds problem. results are significant improvement over traditional results. empirically show that our algorithms outperform state art Flow Cytometry data-set, also apply algorithm model interpretation Inception-v3 deep net classifies images.

参考文章(34)
Matthias Krauledat, Klaus-Robert Müller, Masashi Sugiyama, Covariate Shift Adaptation by Importance Weighted Cross Validation Journal of Machine Learning Research. ,vol. 8, pp. 985- 1005 ,(2007)
Jerzy Splawa-Neyman, D. M. Dabrowska, T. P. Speed, On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 Statistical Science. ,vol. 5, pp. 465- 472 ,(1990) , 10.1214/SS/1177012031
Joseph M. Hilbe, James W. Hardin, Generalized linear models and extensions Stata Press. ,(2007)
Olivier Cappé, Aurélien Garivier, Emilie Kaufmann, On the complexity of best-arm identification in multi-armed bandit models Journal of Machine Learning Research. ,vol. 17, pp. 1- 42 ,(2016)
Jean-Yves Audibert, Sébastien Bubeck, Rémi Munos, Best Arm Identification in Multi-Armed Bandits conference on learning theory. pp. 41- 53 ,(2010)
Aleksandrs Slivkins, Dynamic Ad Allocation: Bandits with Budgets. arXiv: Learning. ,(2013)
Peilin Zhao, Peilin Zhao, Peilin Zhao, Tong Zhang, Tong Zhang, Stochastic Optimization with Importance Sampling for Regularized Loss Minimization international conference on machine learning. ,vol. 1, pp. 1- 9 ,(2015)
Dominik Janzing, Bernhard Schölkopf, Jakob Zscheischler, Joris M. Mooij, Jonas Peters, Distinguishing cause from effect using observational data: methods and benchmarks Journal of Machine Learning Research. ,vol. 17, pp. 1103- 1204 ,(2016)
Tom Heskes, Joris M. Mooij, Cyclic causal discovery from continuous equilibrium data uncertainty in artificial intelligence. pp. 431- 439 ,(2013)
Alexandros G. Dimakis, Sriram Vishwanath, Karthikeyan Shanmugam, Murat Kocaoglu, Learning Causal Graphs with Small Interventions arXiv: Artificial Intelligence. ,(2015)