Active learning for misspecified generalized linear models

DOI:

关键词:

摘要: Active learning refers to algorithmic frameworks aimed at selecting training data points in order to reduce the number of required training data points and/or improve the generalization performance of a learning method. In this paper, we present an asymptotic analysis of active learning for generalized linear models. Our analysis holds under the common practical situation of model misspecification, and is based on realistic assumptions regarding the nature of the sampling distributions, which are usually neither independent nor identical. We derive unbiased estimators of generalization performance, as well as estimators of expected reduction in generalization error after adding a new training data point, that allow us to optimize its sampling distribution through a convex optimization problem. Our analysis naturally leads to an algorithm for sequential active learning which is applicable for all tasks supported by generalized linear models (eg, binary classification, multi-class classification, regression) and can be applied in non-linear settings through the use of Mercer kernels.

neurips.cc 本地加速

neurips.cc PDF 下载加速

参考文章(0)

Active learning for misspecified generalized linear models

来源期刊

我的账户

Active learning for misspecified generalized linear models

来源期刊

相似文章 10

我的账户