Inference with classifiers: a study of structured output problems in natural language processing

作者: Vasin Punyakanok , Dan Roth

DOI:

关键词: Machine learningTask (project management)Process (engineering)Semantic role labelingNatural languageShallow parsingArtificial intelligenceComputer scienceInferenceMetric (mathematics)Natural language processingNatural approach

摘要: A large number of problems in natural language processing (NLP) involve outputs with complex structure. Conceptually such problems, the task is to assign values multiple variables which represent several interdependent components. approach this formulate it as a two-stage process. In first stage, are assigned initial using machine learning based programs. second, an inference procedure uses outcomes stage classifiers along domain specific constraints order infer globally consistent final prediction. This dissertation introduces framework, classifiers, study problems. The framework applied two important and fundamental NLP that structured outputs, shallow parsing semantic role labeling. parsing, goal identify syntactic phrases sentences, has been found useful variety large-scale applications. Semantic labeling identifying predicate-argument structure crucial step toward deeper understanding language. both tasks, we develop state-of-the-art systems have used practice. In shown significance incorporating into way correct improve decisions stand alone classifiers. Although clear necessarily improves global coherency, there no guarantee improvement performance measured terms accuracy local predictions---the metric interest for most We better theoretic issue. Under reasonable assumption, prove sufficient condition cannot degrade respect Hamming loss. addition, provide experimental suggesting can even when conditions not fully satisfied.

参考文章(80)
Kingsbury Paul, Palmer Martha, None, From treebank to propbank language resources and evaluation. ,(2002)
Nianwen Xue, Martha Palmer, None, Calibrating Features for Semantic Role Labeling empirical methods in natural language processing. pp. 88- 94 ,(2004)
Rodrigo de Salvo Braz, Roxana Girju, Vasin Punyakanok, Dan Roth, Mark Sammons, An inference model for semantic entailment in natural language national conference on artificial intelligence. pp. 1043- 1049 ,(2005) , 10.1007/11736790_15
Vasin Punyakanok, Wen-Tau Yih, Dan Roth, Cecilia Ovesdotter Alm, Ramya Nagarajan, Liam Gerard Moran, Gio Kao Kao, Nick Rizzolo, Xin Li, Learning Components for A Question-Answering System. text retrieval conference. pp. 539- 548 ,(2001)
Sham Kakade, Yee Whye Teh, Sam T. Roweis, An Alternate Objective Function for Markovian Fields international conference on machine learning. pp. 275- 282 ,(2002)
Ralf Herbrich, Hugo Zaragoza, Yaoyong Li, John Shawe-Taylor, Jaz S. Kandola, The Perceptron Algorithm with Uneven Margins international conference on machine learning. pp. 379- 386 ,(2002)
Vasin Punyakanok, Dav Zimak, Dan Roth, Marcia Munoz, A Learning Approach to Shallow Parsing empirical methods in natural language processing. ,(1999)
Vasin Punyakanok, Wen-tau Yih, Dan Roth, The necessity of syntactic parsing for semantic role labeling international joint conference on artificial intelligence. pp. 1117- 1123 ,(2005)
Daniel Jurafsky, James H. Martin, Sameer Pradhan, Kadri Hacioglu, Wayne H. Ward, Semantic Role Labeling by Tagging Syntactic Chunks conference on computational natural language learning. pp. 110- 113 ,(2004)