作者: Y. Gotoh , S. Renals , G. Williams
DOI: 10.1109/ICASSP.1999.758175
关键词:
摘要: We introduce named entity (NE) language modelling, a stochastic finite state machine approach to identifying both words and NE categories from stream of spoken data. provide an overview our tagged model (LM) generation together with results the application such LM task out-of-vocabulary (OOV) word reduction in large vocabulary speech recognition. Using Wall Street Journal Broadcast News corpora, it is shown that was able reduce overall error rate by 14%, detecting up 70% previously OOV words. also describe example direct tagging data categories.