作者: Eiji Takimoto , Manfred K. Warmuth
关键词:
摘要: We consider on-line density estimation with a parameterized from an exponential family. In each trial t the learner predicts parameter θt. Then it receives instance xt chosen by adversary and incurs loss - ln p(xt|θt) which is negative log-likelihood of w.r.t. predicted learner. The performance measured regret defined as total minus best off-line. develop algorithm called Last-step Minimax Algorithm that minimax optimal assuming current last one. For one-dimensional families, we give explicit form prediction show its O(ln T), where T number trials. particular, for Bernoulli slightly better than standard Krichevsky-Trofimov probability estimator.