Dynamic conditional random fields

作者: Charles Sutton , Khashayar Rohanimanesh , Andrew McCallum

DOI: 10.1145/1015330.1015422

关键词: Belief propagationPattern recognitionInferenceConditional random fieldBayesian networkVariable eliminationComputer scienceProbabilistic logicArtificial intelligenceDynamic Bayesian networkApproximate inferenceGraphical modelTraining set

摘要: In sequence modeling, we often wish to represent complex interaction between labels, such as when performing multiple, cascaded labeling tasks on the same sequence, or long-range dependencies exist. We present dynamic conditional random fields (DCRFs), a generalization of linear-chain (CRFs) in which each time slice contains set state variables and edges---a distributed representation Bayesian networks (DBNs)---and parameters are tied across slices. Since exact inference can be intractable models, perform approximate using several schedules for belief propagation, including tree-based reparameterization (TRP). On natural-language chunking task, show that DCRF performs better than series CRFs, achieving comparable performance only half training data.

参考文章(57)
Zoubin Ghahramani, Michael Jordan, None, Factorial Hidden Markov Models neural information processing systems. ,vol. 29, pp. 472- 478 ,(1995) , 10.1023/A:1007425814087
Thomas Minka, Divergence measures and message passing pp. 17- ,(2005)
Fuchun Peng, Andrew McCallum, Accurate Information Extraction from Research Papers using Conditional Random Fields north american chapter of the association for computational linguistics. pp. 329- 336 ,(2004)
Yair Weiss, Kevin P. Murphy, Michael I. Jordan, Loopy belief propagation for approximate inference: an empirical study uncertainty in artificial intelligence. pp. 467- 475 ,(1999)
Christopher D. Manning, Hinrich Schütze, Foundations of Statistical Natural Language Processing ,(1999)
Junhwan Kim, Ramin Zabih, Factorial Markov Random Fields european conference on computer vision. pp. 321- 334 ,(2002) , 10.1007/3-540-47977-5_21
Martin J. Wainwright, Alan S. Willsky, Tommi S. Jaakkola, Stochastic processes on graphs with cycles: geometric and variational approaches Massachusetts Institute of Technology. ,(2002)
L. A. Ramshaw, M. P. Marcus, Text Chunking Using Transformation-Based Learning meeting of the association for computational linguistics. pp. 157- 176 ,(1999) , 10.1007/978-94-017-2390-9_10
Mitch Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz, None, Building a large annotated corpus of English: the penn treebank Computational Linguistics. ,vol. 19, pp. 313- 330 ,(1993) , 10.21236/ADA273556