Max-margin Classification of Data with Absent Features

作者: Pieter Abbeel , Gal Elidan , Geremy Heitz , Daphne Koller , Gal Chechik

DOI:

关键词:

摘要: We consider the problem of learning classifiers in structured domains, where some objects have a subset features that are inherently absent due to complex relationships between features. Unlike case feature exists but its value is not observed, here we focus on may even exist (structurally absent) for samples. The common approach handling missing discriminative models first complete their unknown values, and then use standard classification procedure over completed data. This paper focuses known be non-existing, rather than an value. show how incomplete data can classified directly without any completion using max-margin framework. formulate objective function, based geometric interpretation margin, aims maximize margin each sample own relevant subspace. In this formulation, linearly separable transformed into binary search series second order cone programs (SOCP), convex solved efficiently. also describe two approaches optimizing general case: approximation as quadratic program (QP) iterative solving exact problem. By avoiding pre-processing phase which completed, both these could offer considerable computational savings. More importantly, elegant values by our allows it outperform other methods when non-trivial structure, competitive with at random. demonstrate results several benchmarks real-world problems: edge prediction metabolic pathways, automobile detection natural images.

参考文章(50)
Yee Whye Teh, Dilan Grür, Zoubin Ghahramani, None, Stick-breaking Construction for the Indian Buffet Process international conference on artificial intelligence and statistics. pp. 556- 563 ,(2007)
Ariel Jaimovich, Gal Elidan, Hanah Margalit, Nir Friedman, Towards an integrated protein-protein interaction network research in computational molecular biology. pp. 14- 30 ,(2005) , 10.1007/11415770_2
Wray Buntine, Theory refinement on Bayesian networks uncertainty in artificial intelligence. pp. 52- 60 ,(1991) , 10.1016/B978-1-55860-203-8.50010-3
Bernhard Schölkopf, Alexander J. Smola, Learning with Kernels The MIT Press. pp. 626- ,(2018) , 10.7551/MITPRESS/4175.001.0001
Nir Friedman, The Bayesian structural EM algorithm uncertainty in artificial intelligence. pp. 129- 138 ,(1998)
Rosalind W. Picard, Ashish Kapoor, Learning discriminative models with incomplete data Massachusetts Institute of Technology. ,(2006)
David Jensen, Jennifer Neville, Linkage and Autocorrelation Cause Feature Selection Bias in Relational Learning international conference on machine learning. pp. 259- 266 ,(2002)
Christopher K I Williams, Carl Edward Rasmussen, Gaussian Processes for Machine Learning ,(2005)