Speech-language Pre-training for End-to-end Spoken Language Understanding

作者： Yao Qian , Naoyuki Kanda , Yu Shi , Michael Zeng , Leo Shen

DOI:

关键词:

摘要: End-to-end (E2E) spoken language understanding (SLU) can infer semantics directly from speech signal without cascading an automatic speech recognizer (ASR) with a natural …

arxiv.org 本地加速

ieee.org 本地加速

arxiv.org PDF 下载加速

参考文章(23)

Yoshua Bengio, Dmitriy Serdyuk, Jan Chorowski, Kyunghyun Cho, Dzmitry Bahdanau, Attention-based models for speech recognition neural information processing systems. ,vol. 28, pp. 577- 585 ,(2015)

Charles T. Hemphill, John J. Godfrey, George R. Doddington, The ATIS spoken language systems pilot corpus human language technology. pp. 96- 101 ,(1990) , 10.3115/116580.116613

Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, Łukasz Kaiser, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa, Keith Stevens, George Kurian, Nishant Patil, Wei Wang, Cliff Young, Jason Smith, Jason Riesa, Alex Rudnick, Oriol Vinyals, Greg Corrado, Macduff Hughes, Jeffrey Dean, None, Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation arXiv: Computation and Language. ,(2016)

Yao Qian, Rutuja Ubale, Vikram Ramanaryanan, Patrick Lange, David Suendermann-Oeft, Keelan Evanini, Eugene Tsuprun, Exploring ASR-free end-to-end modeling to improve spoken language understanding in a cloud-based dialog system 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). pp. 569- 576 ,(2017) , 10.1109/ASRU.2017.8268987

Chih-Wen Goo, Guang Gao, Yun-Kai Hsu, Chih-Li Huo, Tsung-Chieh Chen, Keng-Wei Hsu, Yun-Nung Chen, Slot-gated modeling for joint slot filling and intent prediction north american chapter of the association for computational linguistics. ,vol. 2, pp. 753- 757 ,(2018) , 10.18653/V1/N18-2118

Yuan-Ping Chen, Ryan Price, Srinivas Bangalore, Spoken Language Understanding without Speech Recognition international conference on acoustics, speech, and signal processing. pp. 6189- 6193 ,(2018) , 10.1109/ICASSP.2018.8461718

Parisa Haghani, Arun Narayanan, Michiel Bacchiani, Galen Chuang, Neeraj Gaur, Pedro Moreno, Rohit Prabhavalkar, Zhongdi Qu, Austin Waters, From Audio to Semantics: Approaches to End-to-End Spoken Language Understanding 2018 IEEE Spoken Language Technology Workshop (SLT). pp. 720- 726 ,(2018) , 10.1109/SLT.2018.8639043

Wen Wang, Qian Chen, Zhu Zhuo, BERT for Joint Intent Classification and Slot Filling arXiv: Computation and Language. ,(2019)

Yoshua Bengio, Vikrant Singh Tomar, Mirco Ravanelli, Loren Lugosch, Patrick Ignoto, Speech Model Pre-training for End-to-End Spoken Language Understanding arXiv: Audio and Speech Processing. ,(2019)

10.

Richard Socher, Ehsan Hosseini-Asl, Caiming Xiong, Pascale Fung, Chien-Sheng Wu, Andrea Madotto, Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems arXiv: Computation and Language. ,(2019)

Speech-language Pre-training for End-to-end Spoken Language Understanding

来源期刊

我的账户

Speech-language Pre-training for End-to-end Spoken Language Understanding

来源期刊

相似文章 3

Pre-training for Spoken Language Understanding with Joint Textual and Phonetic Representation Learning.

Integration of Pre-trained Networks with Continuous Token Interface for End-to-End Spoken Language Understanding.

Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech.

我的账户