Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System

作者: Ananlada Chotimongkol , Chai Wutiwiwatchai , Kwanchiva Saykham

DOI:

关键词:

摘要: This paper investigates the effectiveness of online temporal language model adaptation when applied to a Thai broadcast news transcription task. Our scheme works as follow: first an initial is trained with available during development period. Then adapted over time more recent and articles deployment especially data from same period speech being recognized. We found that are closer in similar terms perplexity suitable for adaptation. The LMs better, both WER, than static LM only set data. Adaptation improved by 38.3% WER 7.1% relatively. Though, achieved less improvement, it still useful resource can be obtained automatically. Better pre-processing techniques selection based on text similarity could obtain further improvement this promising result.

参考文章(10)
Ausdang Thangthai, Anocha Rugchatjaroen, Sittipong Saychum, Chai Wutiwiwatchai, A learning method for Thai phonetization of English words. conference of the international speech communication association. pp. 1777- 1780 ,(2007)
E. W. D. Whittaker, Temporal Adaptation of Language Models ,(2004)
Markpong Jongtaveesataporn, Sadaoki Furui, Koji Iwano, Chai Wutiwiwatchai, Thai Broadcast News Corpus Construction and Evaluation language resources and evaluation. ,(2008)
Roger K. Moore, Computer Speech and Language Elsevier Publishing Company. ,(1986)
Ronald Rosenfeld, Philip Clarkson, Statistical Language Modeling using the CMU-Cambridge Toolkit conference of the international speech communication association. ,(1997)
Marcello Federico, Nicola Bertoldi, Broadcast news LM adaptation over time Computer Speech & Language. ,vol. 18, pp. 417- 435 ,(2004) , 10.1016/J.CSL.2003.10.001
Hiroyuki Segi, Atsushi Matsui, Toru Imai, Nippon Hoso Kyokai, Akio Ando, Akio Kobayashi, Speech Recognition of Broadcast Sports News NHK laboratories note. ,(2001)
Ananlada Chotimongkol, Kwanchiva Saykhum, Patcharika Chootrakool, Nattanun Thatphithakkul, Chai Wutiwiwatchai, LOTUS-BN: A Thai broadcast news corpus and its research applications 2009 Oriental COCOSDA International Conference on Speech Database and Assessments. pp. 44- 50 ,(2009) , 10.1109/ICSDA.2009.5278377
Gilles Adda, Lori Lamel, Jean-Luc Gauvain, Langzhou Chen, Dynamic language modeling for broadcast news. conference of the international speech communication association. ,(2004)
Richard Shillcock, Ellen Gurman Bard, R. J. Lickley, Proceedings of EUROSPEECH-1991. Istituto Internazionale delle Comunicazioni. ,(1991)