作者: YeYi Wang , Alejandro Acero
DOI:
关键词:
摘要: A rules-based grammar is generated. Segmentation ambiguities are identified in training data. Rewrite rules for the ambiguous segmentations enumerated and probabilities generated each. Ambiguities resolved based on probabilities. In one embodiment, this done by applying expectation maximization (EM) algorithm.