Speech-language Pre-training for End-to-end Spoken Language Understanding

作者: Yao Qian , Naoyuki Kanda , Yu Shi , Michael Zeng , Leo Shen

DOI:

关键词:

摘要: End-to-end (E2E) spoken language understanding (SLU) can infer semantics directly from speech signal without cascading an automatic speech recognizer (ASR) with a natural …

参考文章(23)
Yoshua Bengio, Dmitriy Serdyuk, Jan Chorowski, Kyunghyun Cho, Dzmitry Bahdanau, Attention-based models for speech recognition neural information processing systems. ,vol. 28, pp. 577- 585 ,(2015)
Charles T. Hemphill, John J. Godfrey, George R. Doddington, The ATIS spoken language systems pilot corpus human language technology. pp. 96- 101 ,(1990) , 10.3115/116580.116613
Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, Łukasz Kaiser, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa, Keith Stevens, George Kurian, Nishant Patil, Wei Wang, Cliff Young, Jason Smith, Jason Riesa, Alex Rudnick, Oriol Vinyals, Greg Corrado, Macduff Hughes, Jeffrey Dean, None, Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation arXiv: Computation and Language. ,(2016)
Yao Qian, Rutuja Ubale, Vikram Ramanaryanan, Patrick Lange, David Suendermann-Oeft, Keelan Evanini, Eugene Tsuprun, Exploring ASR-free end-to-end modeling to improve spoken language understanding in a cloud-based dialog system 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). pp. 569- 576 ,(2017) , 10.1109/ASRU.2017.8268987
Chih-Wen Goo, Guang Gao, Yun-Kai Hsu, Chih-Li Huo, Tsung-Chieh Chen, Keng-Wei Hsu, Yun-Nung Chen, Slot-gated modeling for joint slot filling and intent prediction north american chapter of the association for computational linguistics. ,vol. 2, pp. 753- 757 ,(2018) , 10.18653/V1/N18-2118
Yuan-Ping Chen, Ryan Price, Srinivas Bangalore, Spoken Language Understanding without Speech Recognition international conference on acoustics, speech, and signal processing. pp. 6189- 6193 ,(2018) , 10.1109/ICASSP.2018.8461718
Parisa Haghani, Arun Narayanan, Michiel Bacchiani, Galen Chuang, Neeraj Gaur, Pedro Moreno, Rohit Prabhavalkar, Zhongdi Qu, Austin Waters, From Audio to Semantics: Approaches to End-to-End Spoken Language Understanding 2018 IEEE Spoken Language Technology Workshop (SLT). pp. 720- 726 ,(2018) , 10.1109/SLT.2018.8639043
Wen Wang, Qian Chen, Zhu Zhuo, BERT for Joint Intent Classification and Slot Filling arXiv: Computation and Language. ,(2019)
Yoshua Bengio, Vikrant Singh Tomar, Mirco Ravanelli, Loren Lugosch, Patrick Ignoto, Speech Model Pre-training for End-to-End Spoken Language Understanding arXiv: Audio and Speech Processing. ,(2019)
Richard Socher, Ehsan Hosseini-Asl, Caiming Xiong, Pascale Fung, Chien-Sheng Wu, Andrea Madotto, Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems arXiv: Computation and Language. ,(2019)