An End-to-End Approach to Joint Social Signal Detection and Automatic Speech Recognition

Hirofumi Lnaguma , Masato Mimura , Koji Inoue , Kazuyoshi Yoshii
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 6214 -6218

2018
End-to-End モデルによる Social Signals 検出および音声認識との統合

HIROFUMI INAGUMA , KOJI INOUE , MASATO MIMURA , TATSUYA KAWAHARA
情報処理学会研究報告 (Web) 2017 ( SLP-117)

2017
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR

Shinsuke Sakai , Tatsuya Kawahara , Masato Mimura , Hirofumi Inaguma
arXiv: Computation and Language

8
2019
Multilingual End-to-End Speech Translation

Kevin Duh , Tatsuya Kawahara , Shinji Watanabe , Hirofumi Inaguma
arXiv: Computation and Language

61
2019
Enhancing Monotonic Multihead Attention for Streaming ASR

Tatsuya Kawahara , Masato Mimura , Hirofumi Inaguma
arXiv: Audio and Speech Processing

27
2020
Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder

Kevin Duh , Tatsuya Kawahara , Shinji Watanabe , Hirofumi Inaguma
arXiv: Computation and Language

16
2020
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition.

Tatsuya Kawahara , Hirofumi Inaguma
arXiv: Audio and Speech Processing

10
2021
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation.

Tatsuya Kawahara , Shinji Watanabe , Hirofumi Inaguma
arXiv: Computation and Language

17
2021
Improved Mask-CTC for Non-Autoregressive End-to-End ASR

Shinji Watanabe , Tetsuji Ogawa , Hirofumi Inaguma , Yosuke Higuchi
international conference on acoustics speech and signal processing

2021
Recent Developments on Espnet Toolkit Boosted By Conformer

Daniel Garcia-Romero , Shinji Watanabe , Tomoki Hayashi , Hirofumi Inaguma
international conference on acoustics speech and signal processing

159
2021
Acoustic-to-Word Attention-Based Model Complemented with Character-Level CTC-Based Model

Sei Ueno , Hirofumi Inaguma , Masato Mimura , Tatsuya Kawahara
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 5804 -5808

21
2018
Transfer Learning of Language-independent End-to-end ASR with Language Model Fusion

Hirofumi Inaguma , Jaejin Cho , Murali Karthick Baskar , Tatsuya Kawahara
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 6096 -6100

14
2019
Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition

Jaejin Cho , Shinji Watanabe , Takaaki Hori , Murali Karthick Baskar
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 6191 -6195

5
2019
End-to-End Speech-to-Dialog-Act Recognition

Viet-Trung Dang , Tianyu Zhao , Sei Ueno , Hirofumi Inaguma
Interspeech 2020 3910 -3914

8
2020
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR.

Hayato Futami , Hirofumi Inaguma , Sei Ueno , Masato Mimura
conference of the international speech communication association 3635 -3639

37
2020
Leveraging Sequence-to-Sequence Speech Synthesis for Enhancing Acoustic-to-Word Speech Recognition

Masato Mimura , Sei Ueno , Hirofumi Inaguma , Shinsuke Sakai
2018 IEEE Spoken Language Technology Workshop (SLT) 477 -484

40
2018
ESPnet-ST: All-in-One Speech Translation Toolkit

Hirofumi Inaguma , Shun Kiyono , Kevin Duh , Shigeki Karita
meeting of the association for computational linguistics 302 -311

114
2020
Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR

Hirofumi Inaguma , Yashesh Gaur , Liang Lu , Jinyu Li
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 6064 -6068

40
2020
Prediction of ice-breaking between participants using prosodic features in the first meeting dialogue

Hirofumi Inaguma , Koji Inoue , Shizuka Nakamura , Katsuya Takanashi
Proceedings of the 2nd Workshop on Advancements in Social Signal Processing for Multimodal Interaction 11 -15

4
2016