Multi-task active learning with output constraints

作者: Yi Zhang

DOI:

关键词:

摘要: Many problems in information extraction, text mining, natural language processing and other fields exhibit the same property: multiple prediction tasks are related sense that their outputs (labels) satisfy certain constraints. In this paper, we propose an active learning framework exploiting such relations among tasks. Intuitively, with task coupled by constraints, can utilize not only uncertainty of a single but also inconsistency predictions across We formalize idea as cross-task value criteria, which reward labeling assignment is propagated measured over all relevant reachable through A specific example our leads to cross entropy measure on tasks, generalizes classical single-task uncertain sampling. conduct experiments two real-world problems: web extraction document classification. Empirical results demonstrate effectiveness actively collecting labeled examples for

参考文章(14)
Russ Greiner, Yuhong Guo, Optimistic active learning using mutual information international joint conference on artificial intelligence. pp. 823- 829 ,(2007)
Eric Horvitz, Sumit Basu, Ashish Kapoor, Selective supervision: guiding supervised learning with decision-theoretic active learning international joint conference on artificial intelligence. pp. 877- 882 ,(2007)
Nicholas Roy, Andrew McCallum, Toward Optimal Active Learning through Sampling Estimation of Error Reduction international conference on machine learning. pp. 441- 448 ,(2001)
Kamal Nigam, Andrew McCallum, A comparison of event models for naive bayes text classification national conference on artificial intelligence. pp. 41- 48 ,(1998)
Guo-Jun Qi, Xian-Sheng Hua, Yong Rui, Jinhui Tang, Hong-Jiang Zhang, Two-Dimensional Active Learning for image classification computer vision and pattern recognition. pp. 1- 8 ,(2008) , 10.1109/CVPR.2008.4587383
Lev Ratinov, Dan Roth, Ming-Wei Chang, Guiding Semi-Supervision with Constraint-Driven Learning meeting of the association for computational linguistics. pp. 280- 287 ,(2007)
Lev Ratinov, Dan Roth, Ming-Wei Chang, Nicholas Rizzolo, Learning and inference with constraints national conference on artificial intelligence. pp. 1513- 1518 ,(2008)
Katrin Tomanek, Roi Reichart, Ari Rappoport, Udo Hahn, Multi-Task Active Learning for Linguistic Annotations meeting of the association for computational linguistics. pp. 861- 869 ,(2008)
Andrew Carlson, Justin Betteridge, Richard C Wang, Estevam R Hruschka Jr, Tom M Mitchell, None, Coupled semi-supervised learning for information extraction web search and data mining. pp. 101- 110 ,(2010) , 10.1145/1718487.1718501
Yiming Yang, Fan Li, David D. Lewis, Tony G. Rose, RCV1: A New Benchmark Collection for Text Categorization Research Journal of Machine Learning Research. ,vol. 5, pp. 361- 397 ,(2004) , 10.5555/1005332.1005345