作者: Ruiyi Zhang , Zheng Wen , Changyou Chen , Chen Fang , Tong Yu
DOI:
关键词:
摘要: Thompson sampling (TS) is a class of algorithms for sequential decision-making, which requires maintaining a posterior distribution over a model. However, calculating exact …