Apparatus and method for estimating, from sparse data, the probability that a particular one of a set of events is the next event in a string of events

作者: Slava M. Katz

DOI:

关键词:

摘要: Apparatus and method for evaluating the likelihood of an event (such as a word) following string known events, based on sequence counts derived from sparse sample data. Event sequences--or m-grams--include key subsequent event. For each m-gram is stored discounted probability generated by applying modified Turing's estimate, example, to count-based probability. occurring in data there normalization constant which preferably (a) adjusts probabilities multiple counting, if any, (b) includes freed mass allocated m-grams do not occur To determine selected "backing off" scheme employed successively shorter keys (of events) followed (representing m-grams) are searched until found having therefor. The constants longer keys--for corresponding have no probability--are combined together with produce being next.

参考文章(12)
John Joseph Hilliard, Walter Steven Rosenbaum, Anne Marie Chaires, Jean Marie Ciconte, Allen Harold Ett, Binary reference matrix for a character recognition machine ,(1974)
Masaaki Honda, Nobuhiko Kitawaki, Fumitada Itakura, Adaptive predictive processing system ,(1982)
Stephen E. Levinson, Syntactic word recognizer Journal of the Acoustical Society of America. ,vol. 73, pp. 2247- 2247 ,(1977) , 10.1121/1.389482
Tadao Nojiri, Continuous speech recognition method and device The Journal of the Acoustical Society of America. ,vol. 83, pp. 405- 405 ,(1988) , 10.1121/1.396214
Stephen L. Moshier, Method and apparatus for continuous word string recognition Journal of the Acoustical Society of America. ,vol. 84, pp. 2306- 2306 ,(1981) , 10.1121/1.396758
Cory S. Myers, Continuous speech pattern recognizer Journal of the Acoustical Society of America. ,vol. 82, pp. 1863- 1863 ,(1981) , 10.1121/1.395726
Stephen E. Levinson, Frank C. Pirz, Syntactic continuous speech recognizer The Journal of the Acoustical Society of America. ,vol. 78, pp. 1930- 1930 ,(1985) , 10.1121/1.392680
F. Jelinek, The development of an experimental discrete dictation recognizer Proceedings of the IEEE. ,vol. 73, pp. 587- 595 ,(1985) , 10.1109/PROC.1985.13343