A learning criterion for stochastic rules

作者: Kenji Yamanishi

DOI: 10.5555/92571.92590

关键词:

摘要: This paper proposes a learning criterion for stochastic rules. is developed by extending Valiant's PAC (Probably Approximately Correct) model, which deterministic Stochastic rules here refer to those probabilistically asign number of classes, lYr, each attribute vector X. The proposed based on the idea that may be regarded as probably approximately correct identification conditional probability distributions over classes given input vectors. An algorithm (an MDL algorithm) (Minimum Description Length) principle used Specifically, with finite partitioning (each specified disjoint cells domain and parameter associated them), this derives target-dependent upper bounds worst-case sample size required learn accuracy confidence. Based these bounds, proves polynomial-sample-size learnability decision lists (which are newly in analogue Rivest's lists) at most k literals (k fixed) decision, trees (a trees) depth. Sufficient conditions polynomial-time any also derived.

参考文章(36)
Robert E. Schapire, Michael J. Kearns, Efficient Distribution-free Learning of Probabilistic Concepts (Extended Abstract) foundations of computer science. pp. 382- 391 ,(1990)
Edwin P. D. Pednault, Some experiments in applying inductive inference principles to surface reconstruction international joint conference on artificial intelligence. pp. 1603- 1609 ,(1989)
Nicolò Cesa-Bianchi, Learning the Distribution in the Extended PAC Model. algorithmic learning theory. pp. 236- 246 ,(1990)
David Haussler, Decision Theoretic Generalizations of the PAC Learning Model. algorithmic learning theory. pp. 21- 41 ,(1990)
Charles H. Kraft, Some conditions for consistency and uniform consistency of statistical procedures University of California Press. ,(1955)
Leon Gordon Kraft, A device for quantizing, grouping, and coding amplitude-modulated pulses Massachusetts Institute of Technology. ,(1949)
Jorma Rissanen, Stochastic Complexity and Modeling Annals of Statistics. ,vol. 14, pp. 1080- 1100 ,(1986) , 10.1214/AOS/1176350051
S. Kullback, A lower bound for discrimination information in terms of variation (Corresp.) IEEE Transactions on Information Theory. ,vol. 13, pp. 126- 127 ,(1967) , 10.1109/TIT.1967.1053968