CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings

Jan Trmal , Sanjeev Khudanpur , Takuya Yoshioka , Yusuke Fujita
The 6th International Workshop on Speech Processing in Everyday Environments (CHiME 2020)

192
2020
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration

Shinji Watanabe , Zhuo Chen , Tomoki Hayashi , Christoph Boeddeker
arXiv: Audio and Speech Processing

44
2020
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals

Yusuke Fujita , Bo Xu , Shinji Watanabe , Jiaming Xu
Unknown Journal

20
2020
End-to-end multi-speaker speech recognition with transformer

Yanmin Qian , Jonathan Le Roux , Shinji Watanabe , Xuankai Chang
Unknown Journal

62
2020
SUPERB: Speech processing Universal PERformance Benchmark.

Shang-Wen Li , Shinji Watanabe , Hung-yi Lee , Xuankai Chang
arXiv: Computation and Language

279
2021
Hypothesis Stitcher for End-to-End Speaker-Attributed ASR on Long-Form Multi-Talker Recordings

Takuya Yoshioka , Xiaofei Wang , Yashesh Gaur , Naoyuki Kanda
international conference on acoustics speech and signal processing

6
2021
Recent Developments on Espnet Toolkit Boosted By Conformer

Daniel Garcia-Romero , Shinji Watanabe , Tomoki Hayashi , Hirofumi Inaguma
international conference on acoustics speech and signal processing

159
2021
End-to-end Monaural Multi-speaker ASR System without Pretraining

Xuankai Chang , Yanmin Qian , Kai Yu , Shinji Watanabe
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 6256 -6260

63
2019
Improving End-to-End Single-Channel Multi-Talker Speech Recognition

Wangyou Zhang , Xuankai Chang , Yanmin Qian , Shinji Watanabe
IEEE Transactions on Audio, Speech, and Language Processing 28 1385 -1394

16
2020
End-to-end ASR with adaptive span self-attention

Xuankai Chang , Aswin Shanmugam Subramanian , Pengcheng Guo , Shinji Watanabe
conference of the international speech communication association 3595 -3599

9
2020
Adaptive Permutation Invariant Training with Auxiliary Information for Monaural Multi-Talker Speech Recognition

Xuankai Chang , Yanmin Qian , Dong Yu
international conference on acoustics, speech, and signal processing 5974 -5978

10
2018
Single-channel multi-talker speech recognition with permutation invariant training

Yanmin Qian , Xuankai Chang , Dong Yu
Speech Communication 104 1 -11

38
2018
MIMO-Speech: End-to-End Multi-Channel Multi-Speaker Speech Recognition

Xuankai Chang , Wangyou Zhang , Yanmin Qian , Jonathan Le Roux
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) 237 -244

82
2019
Train from scratch: Single-stage joint training of speech separation and recognition

Jing Shi , Xuankai Chang , Shinji Watanabe , Bo Xu
Computer Speech \& Language 76 101387

2
2022
Joint speech recognition and audio captioning

Chaitanya Narisetty , Emiru Tsunoo , Xuankai Chang , Yosuke Kashiwagi
Smpte Journal 7892 -7896

1
2022
SUPERB-SG: Enhanced speech processing universal performance benchmark for semantic and generative capabilities

Hsiang-Sheng Tsai , Heng-Jui Chang , Wen-Chin Huang , Zili Huang
arXiv preprint arXiv:2203.06849

24
2022
An exploration of self-supervised pretrained representations for end-to-end speech recognition

Xuankai Chang , Takashi Maekaku , Pengcheng Guo , Jing Shi
Smpte Journal 228 -235

30
2021
Espnet-slu: Advancing spoken language understanding through espnet

Siddhant Arora , Siddharth Dalmia , Pavel Denisov , Xuankai Chang
Smpte Journal 7167 -7171

22
2022
ESPnet-SE++: Speech enhancement for robust speech recognition, translation, and understanding

Yen-Ju Lu , Xuankai Chang , Chenda Li , Wangyou Zhang
arXiv preprint arXiv:2207.09514

6
2022
End-to-end integration of speech recognition, speech enhancement, and self-supervised learning representation

Xuankai Chang , Takashi Maekaku , Yuya Fujita , Shinji Watanabe
arXiv preprint arXiv:2204.00540

7
2022