Multi-style adaptive training for robust cross-lingual spoken language understanding

作者: Xiaodong He , Li Deng , Dilek Hakkani-Tur , Gokhan Tur

DOI: 10.1109/ICASSP.2013.6639292

关键词:

摘要: Given the increasingly available machine translation (MT) services nowadays, one efficient strategy for cross-lingual spoken language understanding (SLU) is to first translate input utterance from second into primary language, and then call SLU system decode semantic knowledge. However, errors introduced in MT process create a condition similar “mismatch” encountered robust speech recognition. Such mismatch makes performance of far acceptable. Motivated by successful solutions developed recognition, we this paper propose multi-style adaptive training method improve robustness tasks. For evaluation, created an English-Chinese bilingual ATIS database, carried out series experiments on that database experimentally assess proposed methods. Experimental results show that, without relying any data significantly improves task while producing no degradation language. This greatly facilitates porting as many languages there are systems human effort. We further study approach another type condition, caused recognition errors, demonstrate its success also.

参考文章(28)
Bassam Jabaian, Fabrice Lefèvre, Laurent Besacier, Investigating multiple approaches for SLU portability to a new language conference of the international speech communication association. pp. 2502- 2505 ,(2010)
Li Deng, Front-End, Back-End, and Hybrid Techniques for Noise-Robust Speech Recognition Robust Speech Recognition of Uncertain or Missing Data. pp. 67- 99 ,(2011) , 10.1007/978-3-642-21317-5_4
Xiaodong He, Xin Lei, Jon Hamaker, Robust feature space adaptation for telephony speech recognition. conference of the international speech communication association. ,(2006)
François Mairesse, Steve J. Young, Fabrice Lefèvre, Cross-Lingual spoken language understanding from unaligned data using discriminative classification models and machine translation conference of the international speech communication association. pp. 78- 81 ,(2010)
Jianfeng Gao, Xiaodong He, Amittai Axelrod, Domain Adaptation via Pseudo In-Domain Data Selection empirical methods in natural language processing. pp. 355- 362 ,(2011)
Stephanie Seneff, TINA: a natural language system for spoken language applications Computational Linguistics. ,vol. 18, pp. 61- 86 ,(1992)
Christophe Servan, Nathalie Camelin, Christian Raymond, Frederic Bechet, Renato De Mori, On the use of machine translation for spoken language understanding portability international conference on acoustics, speech, and signal processing. pp. 5330- 5333 ,(2010) , 10.1109/ICASSP.2010.5494960
Xiaodong He, Li Deng, Wu Chou, Discriminative learning in sequential pattern recognition IEEE Signal Processing Magazine. ,vol. 25, pp. 14- 36 ,(2008) , 10.1109/MSP.2008.926652