Shift-pessimistic active learning using robust bias-aware prediction

作者: Brian D. Ziebart , Anqi Liu , Lev Reyzin

DOI:

关键词: Semi-supervised learningPessimismComputer scienceBinary classificationActive learningMachine learningMulti-task learningGeneralization errorArtificial intelligenceProbabilistic logic

摘要: Existing approaches to active learning are generally optimistic about their certainty with respect data shift between labeled and unlabeled data. They assume that unknown datapoint labels follow the inductive biases of learner. As a result, most useful data-point labels—ones refute current biases— rarely solicited. We propose shift-pessimistic approach assumes worst-case conditional label distribution. This closely aligns model uncertainty generalization error, enabling more solicitation. investigate theoretical benefits this demonstrate its empirical advantages on probabilistic binary classification tasks.

参考文章(30)
David J. C. MacKay, The evidence framework applied to classification networks Neural Computation. ,vol. 4, pp. 720- 736 ,(1992) , 10.1162/NECO.1992.4.5.720
Josh Attenberg, Foster Provost, Inactive learning? ACM SIGKDD Explorations Newsletter. ,vol. 12, pp. 36- 41 ,(2011) , 10.1145/1964897.1964906
Masashi Sugiyama, Shinichi Nakajima, Hisashi Kashima, Paul von Bünau, Motoaki Kawanabe, Direct Importance Estimation with Model Selection and Its Application to Covariate Shift Adaptation neural information processing systems. ,vol. 20, pp. 1433- 1440 ,(2007)
Brian Ziebart, Anqi Liu, Robust Classification Under Sample Selection Bias neural information processing systems. ,vol. 27, pp. 37- 45 ,(2014)
Francis R. Bach, Active learning for misspecified generalized linear models neural information processing systems. pp. 65- 72 ,(2006)
Yao-liang Yu, Csaba Szepesv ri, Analysis of Kernel Mean Matching under Covariate Shift international conference on machine learning. pp. 1147- 1154 ,(2012)
Yishay Mansour, Corinna Cortes, Mehryar Mohri, Learning Bounds for Importance Weighting neural information processing systems. ,vol. 23, pp. 442- 450 ,(2010)
Bernhard Schölkopf, Alex J. Smola, Karsten M. Borgwardt, Jiayuan Huang, Arthur Gretton, Correcting Sample Selection Bias by Unlabeled Data neural information processing systems. ,vol. 19, pp. 601- 608 ,(2006)
Flemming Topsøe, Information-theoretical optimization techniques Kybernetika. ,vol. 15, pp. 8- 27 ,(1979)
Masashi Sugiyama, Active Learning for Misspecified Models neural information processing systems. ,vol. 18, pp. 1305- 1312 ,(2005)