Spectral estimation of hidden Markov models

作者: Jordan Rodu

DOI:

关键词:

摘要: This thesis extends and improves methods for estimating key quantities of hidden Markov models through spectral method-of-moments estimation. Unlike traditional estimation like EM Gibbs sampling, the set methods, which we call HMMs (sHMMs), are incredibly fast, do not require multiple restarts, come with provable guarantees. Our first result upon original algorithm by parameters from fully reduced data. We also show that developed in dimensional version can be estimated using various forms regression, lead to major speed gains, as well allowing flexibility scheme. then extend beyond basic latent variable tree structures have linguistic applications, especially dependency parsing, finally output is a high-dimensional, continuously distributed variable. factored into two componentsestimation state space dynamics, observation probability distributions. leads extremely flexible procedures tailored precisely task interest. These tools all simple implement, naturally incorporate dimension reduction, allows them scale gracefully data increases. Degree Type Dissertation Name Doctor Philosophy (PhD) Graduate Group Statistics First Advisor Dean Foster Subject Categories Probability dissertation available at ScholarlyCommons: http://repository.upenn.edu/edissertations/1423 SPECTRAL ESTIMATION OF HIDDEN MARKOV MODELS

参考文章(46)
Fernando Pereira, Ryan Mcdonald, Discriminative learning and spanning tree algorithms for dependency parsing University of Pennsylvania. ,(2006)
Dean P. Foster, Lyle H. Ungar, Jordan Rodu, Spectral dimensionality reduction for HMMs arXiv: Machine Learning. ,(2012)
Sham M. Kakade, Dean P. Foster, Multi-view regression via canonical correlation analysis conference on learning theory. pp. 82- 96 ,(2007) , 10.1007/978-3-540-72927-3_8
Sebastiaan A. Terwijn, On the Learnability of Hidden Markov Models international colloquium on grammatical inference. pp. 261- 268 ,(2002) , 10.1007/3-540-45790-9_21
Jason Eisner, Bilexical Grammars and their Cubic-Time Parsing Algorithms Springer Netherlands. pp. 29- 61 ,(2000) , 10.1007/978-94-015-9470-7_3
Mitch Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz, None, Building a large annotated corpus of English: the penn treebank Computational Linguistics. ,vol. 19, pp. 313- 330 ,(1993) , 10.21236/ADA273556
Sebastian Thrun, Wolfram Burgard, Dieter Fox, A Probabilistic Approach to Concurrent Mapping and Localization for Mobile Robots Machine Learning. ,vol. 31, pp. 253- 271 ,(1998) , 10.1023/A:1007436523611
Adwait Ratnaparkhi, A Maximum Entropy Model for Part-Of-Speech Tagging empirical methods in natural language processing. ,(1996)
Denis Turdakov, Dmitry Lizorkin, HMM Expanded to Multiple Interleaved Chains as a Model for Word Sense Disambiguation pacific asia conference on language information and computation. pp. 549- 558 ,(2009)
M.P. Schützenberger, On the definition of a family of automata Information & Computation. ,vol. 4, pp. 245- 270 ,(1961) , 10.1016/S0019-9958(61)80020-X