作者: S. Deligne , F. Bimbot
DOI: 10.1109/ICASSP.1995.479391
关键词:
摘要: The multigram model assumes that language can be described as the output of a memoryless source emits variable-length sequences words. estimation parameters formulated maximum likelihood problem from incomplete data. We show estimates computed through an iterative expectation-maximization algorithm and we describe forward-backward procedure for its implementation. report results systematical evaluation multigrams modeling on ATIS database. objective performance measure is test set perplexity. Our outperform conventional n-grams this task.