MaxGap Bandit: Adaptive Algorithms for Approximate Ranking

作者： Robert D. Nowak , Ardhendu Tripathy , Sumeet Katariya

DOI:

关键词: Key (cryptography) 、 Sorting 、 Ranking 、 Computer science 、 Sampling (statistics) 、 Rank (linear algebra) 、 Anomaly detection 、 Identification (information) 、 Algorithm 、 Minimax

摘要: This paper studies the problem of adaptively sampling from K distributions (arms) in order to identify largest gap between any two adjacent means. We call this MaxGap-bandit problem. arises naturally approximate ranking, noisy sorting, outlier detection, and top-arm identification bandits. The key novelty MaxGap bandit is that it aims determine natural partitioning into a subset with larger means smaller means, where split determined by rather than pre-specified rank or threshold. Estimating an arm’s requires its neighboring arms addition itself, dependence results novel hardness parameter characterizes sample complexity propose elimination UCB-style algorithms show they are minimax optimal. Our experiments require 6-8x fewer samples non-adaptive achieve same error.

参考文章(0)

MaxGap Bandit: Adaptive Algorithms for Approximate Ranking

来源期刊

我的账户

MaxGap Bandit: Adaptive Algorithms for Approximate Ranking

来源期刊

相似文章 1

Robust Outlier Arm Identification.

我的账户