Learning the Kernel with Hyperkernels

作者: Cheng Soon Ong , Alexander Smola , Robert Williamson

DOI: 10.5555/1046920.1088708

关键词: Reproducing kernel Hilbert spaceMathematical optimizationKernel methodTree kernelKernel embedding of distributionsRadial basis function kernelMathematicsPolynomial kernelRepresenter theoremKernel (statistics)

摘要: This paper addresses the problem of choosing a kernel suitable for estimation with support vector machine, hence further automating machine learning. goal is achieved by defining reproducing Hilbert space on kernels itself. Such formulation leads to statistical similar minimizing regularized risk functional.We state equivalent representer theorem choice and present semidefinite programming resulting optimization problem. Several recipes constructing hyperkernels are provided, as well details common learning problems. Experimental results classification, regression novelty detection UCI data show feasibility our approach.

参考文章(41)
V. N. Vapnik, The Nature of Statistical Learning Theory. ,(1995)
Tong Zhang, Some Sparse Approximation Bounds for Regression Problems international conference on machine learning. pp. 624- 631 ,(2001)
G. Rätsch, T. Onoda, K.-R. Müller, Soft Margins for AdaBoost Machine Learning. ,vol. 42, pp. 287- 320 ,(2001) , 10.1023/A:1007618119488
Christopher M. Bishop, Neural networks for pattern recognition ,(1995)
Peter Bartlett, John Shawe-Taylor, Generalization performance of support vector machines and other pattern classifiers Advances in kernel methods. pp. 43- 54 ,(1999)
Gert R. G. Lanckriet, Peter L. Bartlett, Laurent El Ghaoui, Nello Christianini, Michael I. Jordan, Learning the Kernel Matrix with Semi-Definite Programming international conference on machine learning. pp. 323- 330 ,(2002)
Jos F. Sturm, Using SeDuMi 1.02, a MATLAB toolbox for optimization over symmetric cones Optimization Methods & Software. ,vol. 11, pp. 625- 653 ,(1999) , 10.1080/10556789908805766
Dennis D. Cox, Finbarr O'Sullivan, Asymptotic Analysis of Penalized Likelihood and Related Estimators Annals of Statistics. ,vol. 18, pp. 1676- 1695 ,(1990) , 10.1214/AOS/1176347872
N. Aronszajn, Theory of Reproducing Kernels. Transactions of the American Mathematical Society. ,vol. 68, pp. 337- 404 ,(1950) , 10.1090/S0002-9947-1950-0051437-7