Subhojyoti Mukherjee

机构: University of Wisconsin Madison

主页: subhojyoti.github.io

每年引用次数

引用次数

引用: 170

H-指数: 7

I10-指数 : 7

出版物: 21

标题

引用次数

年份

Finite-time Analysis of Frequentist Strategies for Multi-armed Bandits

SUBHOJYOTI MUKHERJEE

2018

Latent Ranked Bandits

Subhojyoti Mukherjee , Anup B. Rao , Branislav Kveton

2019

A Unified Approach to Translate Classical Bandit Algorithms to Structured Bandits

Osman Yagan , Gauri Joshi , Subhojyoti Mukherjee , Samarth Gupta
international conference on acoustics speech and signal processing

2021

A Unified Approach to Translate Classical Bandit Algorithms to the Structured Bandit Setting

Samarth Gupta , Shreyas Chaudhari , Subhojyoti Mukherjee , Gauri Joshi
IEEE Journal on Selected Areas in Information Theory 1 ( 3) 840 -853

2020

Nearly optimal algorithms for level set estimation

Blake Mason , Romain Camilleri , Subhojyoti Mukherjee , Kevin Jamieson
arXiv preprint arXiv:2111.01768

2021

Chernoff sampling for active testing and extension to active regression

Subhojyoti Mukherjee , Ardhendu S Tripathy , Robert Nowak
International Conference on Artificial Intelligence and Statistics 7384 -7432

2022

Revar: Strengthening policy evaluation via reduced variance sampling

Subhojyoti Mukherjee , Josiah P Hanna , Robert D Nowak
Uncertainty in Artificial Intelligence 1413 -1422

2022

Speed: Experimental design for policy evaluation in linear heteroscedastic bandits

Subhojyoti Mukherjee , Qiaomin Xie , Josiah P Hanna , Robert Nowak
International Conference on Artificial Intelligence and Statistics 2962 -2970

2024

Multi-task Representation Learning for Pure Exploration in Bilinear Bandits

Subhojyoti Mukherjee , Qiaomin Xie , Josiah Hanna , Robert Nowak
Advances in Neural Information Processing Systems 36

2024

Efficient and Interpretable Bandit Algorithms

Subhojyoti Mukherjee , Ruihao Zhu , Branislav Kveton
arXiv preprint arXiv:2310.14751

2023

Experimental Design for Active Transductive Inference in Large Language Models

Subhojyoti Mukherjee , Ge Liu , Aniket Deshmukh , Anusha Lalitha
arXiv preprint arXiv:2404.08846

2024

Thresholding bandits with augmented ucb

Subhojyoti Mukherjee , Kolar Purushothama Naveen , Nandan Sudarsanam , Balaraman Ravindran
arXiv preprint arXiv:1704.02281

2017

Efficient-ucbv: An almost optimal algorithm using variance estimates

Subhojyoti Mukherjee , KP Naveen , Nandan Sudarsanam , Balaraman Ravindran
Proceedings of the AAAI Conference on Artificial Intelligence 32 ( 1)

2018

Distribution-dependent and time-uniform bounds for piecewise iid bandits

Subhojyoti Mukherjee , Odalric-Ambrym Maillard
arXiv preprint arXiv:1905.13159

2019

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning

Subhojyoti Mukherjee , Josiah P Hanna , Qiaomin Xie , Robert Nowak
arXiv preprint arXiv:2406.05064

2024

Off-Policy Evaluation from Logged Human Feedback

Aniruddha Bhargava , Lalit Jain , Branislav Kveton , Ge Liu
arXiv preprint arXiv:2406.10030

2024

Optimal Design for Human Preference Elicitation

Subhojyoti Mukherjee , Anusha Lalitha , Kousha Kalantari , Aniket Deshmukh
Advances in Neural Information Processing Systems 37

2024

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP

Subhojyoti Mukherjee , Josiah P Hanna , Robert Nowak
International Conference on Machine Learning (ICML 2024)

2024

Safety Aware Changepoint Detection for Piecewise iid Bandits

Subhojyoti Mukherjee
Proceedings of the Thirty-Eighth Conference on Uncertainty in Artificial Intelligence 180 1402 -1412

2022

ACTIVE SEQUENTIAL HYPOTHESIS TESTING WITH EXTENSION TO ACTIVE REGRESSION AND MULTI-ARMED BANDITS

Subhojyoti Mukherjee

2021

Multi-armed Bandits

Reinforcement Learning

Large Language Models

RLHF

Subhojyoti Mukherjee

引用次数

出版物: 21

我的账户