Method and system for building a topic specific language model for use in automatic speech recognition

作者: Feng Rao , Eryu Wang , Shuai Yue , Lou Li , Duling Lu

DOI:

关键词: Language modelSpeech corpusFactored language modelSpeech recognitionNatural language processingArtificial intelligenceCache language modelComputer scienceSpeech analyticsSpeech processingAcoustic modeln-gram

摘要: An automatic speech recognition method includes at a computer having one or more processors and memory for storing programs to be executed by the processors, obtaining plurality of corpus categories through classifying calculating raw corpus; classified language models that respectively correspond model training applied on each category; an interpolation implementing weighted merging interpolated models; constructing decoding resource in accordance with acoustic model; input using resource, outputting character string highest probability as result speech.

参考文章(37)
Kristie Seymore, Ronald Rosenfeld, Large-Scale Topic Detection and Language Model Adaptation. Defense Technical Information Center. ,(1997) , 10.21236/ADA327553
Roland Kuhn, George Foster, Means and Method for Adapted Language Translation ,(2006)
Alejandro Acero, Yik-Cheung Tam, Milind Mahajan, Ciprian Chelba, Language model adaptation using semantic supervision ,(2004)
Brandon M. Ballinger, Michael H. Cohen, Johan Schalkwyk, Cyril Georges Luc Allauzen, Language Model Selection for Speech-to-Text Conversion ,(2010)
Benyu Zhang, HuaJun Zeng, Zheng Chen, Jian Wang, Jilin Chen, Diverse topic phrase extraction ,(2007)
Carl Joseph Kraenzel, Baiju Dhirajlal Mandalia, David M. Lubensky, Dialog filtering for filling out a form ,(2008)
Monika Woszczyna, Detlef Koll, Michael Finke, Girija Yegnanarayanan, Juergen Fritsch, Document Transcription System Training ,(2005)
Jianfeng Gao, Xiaodong He, Amittai Axelrod, Selection of domain-adapted translation subcorpora ,(2011)