Convergence rates of active learning for maximum likelihood estimation

作者: Kamalika Chaudhuri , Sham M Kakade , Praneeth Netrapalli , Sujay Sanghavi , None

DOI:

关键词:

摘要: An active learner is given a class of models, large set unlabeled examples, and the ability to interactively query labels subset these examples; goal learn model in that fits data well. Previous theoretical work has rigorously characterized label complexity learning, but most this focused on PAC or agnostic model. In paper, we shift our attention more general setting - maximum likelihood estimation. Provided certain conditions hold class, provide two-stage learning algorithm for problem. The require are fairly general, cover widely popular Generalized Linear Models, which turn, include models binary multi-class classification, regression, conditional random fields. We an upper bound requirement algorithm, lower matches it up order terms. Our analysis shows unlike classification realizable case, just single extra round interaction sufficient achieve near-optimal performance On empirical side, recent [12] [13] (on linear logistic regression) promise approach.

参考文章(21)
Ruth Urner, Sharon Wulff, Shai Ben-David, PLAL: Cluster-based active learning conference on learning theory. pp. 376- 397 ,(2013)
Lucien M Le Cam, Asymptotic methods in statistical theory Springer-Verlag New York, Inc.. ,(1986)
Matti Kääriäinen, Active learning in the non-realizable case algorithmic learning theory. pp. 63- 77 ,(2006) , 10.1007/11894841_9
Alina Beygelzimer, John Langford, Daniel J. Hsu, Zhang Tong, Agnostic Active Learning Without Constraints neural information processing systems. ,vol. 23, pp. 199- 207 ,(2010)
Robert D. Nowak, The Geometry of Generalized Binary Search IEEE Transactions on Information Theory. ,vol. 57, pp. 7893- 7906 ,(2011) , 10.1109/TIT.2011.2169298
Sanjoy Dasgupta, Two faces of active learning Theoretical Computer Science. ,vol. 412, pp. 1767- 1781 ,(2011) , 10.1016/J.TCS.2010.12.054
Sanjoy Dasgupta, Daniel Hsu, Hierarchical sampling for active learning Proceedings of the 25th international conference on Machine learning - ICML '08. pp. 208- 215 ,(2008) , 10.1145/1390156.1390183
Maria-Florina Balcan, Alina Beygelzimer, John Langford, Agnostic active learning Journal of Computer and System Sciences. ,vol. 75, pp. 78- 89 ,(2009) , 10.1016/J.JCSS.2008.07.003