A decision-theoretic extension of stochastic complexity and its applications to learning

作者: K. Yamanishi

DOI: 10.1109/18.681319

关键词:

摘要: Rissanen (1978) has introduced stochastic complexity to define the amount of information in a given data sequence relative hypothesis class probability densities, where is measured terms logarithmic loss associated with universal compression. This paper introduces notion extended (ESC) and demonstrates its effectiveness design analysis learning algorithms on-line prediction batch-learning scenarios. ESC can be thought as an extension Rissanen's decision-theoretic setting general real-valued function used distortion measure. As application prediction, this shows that sequential realization produces algorithm called Vovk's aggregating strategy, which Bayes algorithm. We derive upper bounds on cumulative for strategy both expected form worst case continuous. batch-learning, batch-approximation induces minimum L-complexity (MLC), description length (MDL) principle. statistical risk MLC, are least date. Through we give unifying view most effective have been explored computational theory.

参考文章(27)
David Haussler, Jyrki Kivinen, Manfred K. Warmuth, Tight worst-case loss bounds for predicting with expert advice european conference on computational learning theory. pp. 69- 83 ,(1994) , 10.1007/3-540-59119-2_169
Jyrki Kivinen, Manfred K. Warmuth, Using experts for predicting continuous outcomes european conference on computational learning theory. pp. 109- 120 ,(1994)
Andrew R Barron, None, Complexity Regularization with Application to Artificial Neural Networks Nonparametric Functional Estimation and Related Topics. pp. 561- 576 ,(1991) , 10.1007/978-94-011-3222-0_42
Jorma Rissanen, Stochastic Complexity and Modeling Annals of Statistics. ,vol. 14, pp. 1080- 1100 ,(1986) , 10.1214/AOS/1176350051
Yoav Freund, Predicting a binary sequence almost as well as the optimal biased coin Proceedings of the ninth annual conference on Computational learning theory - COLT '96. pp. 89- 98 ,(1996) , 10.1145/238061.238072
Nicolò Cesa-Bianchi, Yoav Freund, David Haussler, David P. Helmbold, Robert E. Schapire, Manfred K. Warmuth, How to use expert advice Journal of the ACM. ,vol. 44, pp. 427- 485 ,(1997) , 10.1145/258128.258179
Kenji Yamanishi, Probably Almost Discriminative Learning Machine Learning. ,vol. 18, pp. 23- 50 ,(1995) , 10.1023/A:1022870506888
CE Shennon, Warren Weaver, A mathematical theory of communication Bell System Technical Journal. ,vol. 27, pp. 379- 423 ,(1948) , 10.1002/J.1538-7305.1948.TB01338.X
Kenji Yamanishi, On-Line Maximum Likelihood Prediction with Respect to General Loss Functions conference on learning theory. ,vol. 55, pp. 105- 118 ,(1997) , 10.1006/JCSS.1997.1503