Latent Ranked Bandits

Subhojyoti Mukherjee , Anup B. Rao , Branislav Kveton

2019
A Unified Approach to Translate Classical Bandit Algorithms to Structured Bandits

Osman Yagan , Gauri Joshi , Subhojyoti Mukherjee , Samarth Gupta
international conference on acoustics speech and signal processing

2021
A Unified Approach to Translate Classical Bandit Algorithms to the Structured Bandit Setting

Samarth Gupta , Shreyas Chaudhari , Subhojyoti Mukherjee , Gauri Joshi
IEEE Journal on Selected Areas in Information Theory 1 ( 3) 840 -853

2
2020
Nearly optimal algorithms for level set estimation

Blake Mason , Romain Camilleri , Subhojyoti Mukherjee , Kevin Jamieson
arXiv preprint arXiv:2111.01768

14
2021
Chernoff sampling for active testing and extension to active regression

Subhojyoti Mukherjee , Ardhendu S Tripathy , Robert Nowak
International Conference on Artificial Intelligence and Statistics 7384 -7432

9
2022
Revar: Strengthening policy evaluation via reduced variance sampling

Subhojyoti Mukherjee , Josiah P Hanna , Robert D Nowak
Uncertainty in Artificial Intelligence 1413 -1422

8
2022
Speed: Experimental design for policy evaluation in linear heteroscedastic bandits

Subhojyoti Mukherjee , Qiaomin Xie , Josiah P Hanna , Robert Nowak
International Conference on Artificial Intelligence and Statistics 2962 -2970

6
2024
Multi-task Representation Learning for Pure Exploration in Bilinear Bandits

Subhojyoti Mukherjee , Qiaomin Xie , Josiah Hanna , Robert Nowak
Advances in Neural Information Processing Systems 36

1
2024
Efficient and Interpretable Bandit Algorithms

Subhojyoti Mukherjee , Ruihao Zhu , Branislav Kveton
arXiv preprint arXiv:2310.14751

1
2023
Experimental Design for Active Transductive Inference in Large Language Models

Subhojyoti Mukherjee , Ge Liu , Aniket Deshmukh , Anusha Lalitha
arXiv preprint arXiv:2404.08846

2024
Thresholding bandits with augmented ucb

Subhojyoti Mukherjee , Kolar Purushothama Naveen , Nandan Sudarsanam , Balaraman Ravindran
arXiv preprint arXiv:1704.02281

31
2017
Efficient-ucbv: An almost optimal algorithm using variance estimates

Subhojyoti Mukherjee , KP Naveen , Nandan Sudarsanam , Balaraman Ravindran
Proceedings of the AAAI Conference on Artificial Intelligence 32 ( 1)

14
2018
Distribution-dependent and time-uniform bounds for piecewise iid bandits

Subhojyoti Mukherjee , Odalric-Ambrym Maillard
arXiv preprint arXiv:1905.13159

11
2019
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning

Subhojyoti Mukherjee , Josiah P Hanna , Qiaomin Xie , Robert Nowak
arXiv preprint arXiv:2406.05064

2024
Off-Policy Evaluation from Logged Human Feedback

Aniruddha Bhargava , Lalit Jain , Branislav Kveton , Ge Liu
arXiv preprint arXiv:2406.10030

2024
Optimal Design for Human Preference Elicitation

Subhojyoti Mukherjee , Anusha Lalitha , Kousha Kalantari , Aniket Deshmukh
Advances in Neural Information Processing Systems 37

3
2024
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP

Subhojyoti Mukherjee , Josiah P Hanna , Robert Nowak
International Conference on Machine Learning (ICML 2024)

1
2024
Safety Aware Changepoint Detection for Piecewise iid Bandits

Subhojyoti Mukherjee
Proceedings of the Thirty-Eighth Conference on Uncertainty in Artificial Intelligence 180 1402 -1412

1
2022