Bayesian methods for Support Vector machines and Gaussian processes

作者: Matthias Seeger

DOI:

关键词:

摘要: We present a common probabilistic framework for kernel or spline smooth- ing methods, including popular architectures such as Gaussian processes and Support Vector machines. identify the problem of unnormalized loss func- tions suggest general technique to overcome this at least ap- proximately. give an intuitive interpretation effect function can induce, by comparing classification (SVC) with process (GPC) nonparametric generalization logistic regression. This relates SVC boosting techniques. propose variational Bayesian model selection algorithm nor- malized functions. has wider applicability than other previously suggested techniques exhibits comparable perfor- mance in cases where both are applicable. discuss results substantial number experiments which we applied vari- ational real-world tasks compared it range known methods. The scope thesis is provide bridge between fields Statistical Learning Theory, some material tutorial nature hope will be useful researchers fields.

参考文章(75)
D. J. C. Mackay, Introduction to Gaussian processes NATO advanced study institute on generalization in neural networks and machine learning. pp. 133- 165 ,(1998)
James Tin-Yau Kwok, Integrating the evidence framework and the support vector machine the european symposium on artificial neural networks. pp. 177- 182 ,(1999)
David J. C. MacKay, Hyperparameters: Optimize, or Integrate Out? Maximum Entropy and Bayesian Methods. pp. 43- 59 ,(1996) , 10.1007/978-94-015-8729-7_2
Tommi S. Jaakkola, Michael I. Jordan, Variational methods for inference and estimation in graphical models Massachusetts Institute of Technology. ,(1997)
Christopher J. C. Burges, Geometry and invariance in kernel based methods Advances in kernel methods. pp. 89- 116 ,(1999)
John C. Platt, Fast training of support vector machines using sequential minimal optimization Advances in kernel methods. pp. 185- 208 ,(1999)
Umesh V. Vazirani, Michael J. Kearns, An Introduction to Computational Learning Theory ,(1994)
Peter McCullagh, John Ashworth Nelder, Generalized Linear Models ,(1983)
Christopher M. Bishop, Neural networks for pattern recognition ,(1995)