Hirofumi Inaguma

机构: Fundamental AI Research (FAIR) at Meta

主页: hirofumi0810.github.io

每年引用次数

引用次数

引用: 2,206

H-指数: 20

I10-指数 : 29

出版物: 52

标题

引用次数

年份

An End-to-End Approach to Joint Social Signal Detection and Automatic Speech Recognition

Hirofumi Lnaguma , Masato Mimura , Koji Inoue , Kazuyoshi Yoshii
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 6214 -6218

2018

End-to-End モデルによる Social Signals 検出および音声認識との統合

HIROFUMI INAGUMA , KOJI INOUE , MASATO MIMURA , TATSUYA KAWAHARA
情報処理学会研究報告 (Web) 2017 ( SLP-117)

2017

Joint Social Signal Detection and Automatic Speech Recognition based on End-to-End Modeling and Multi-task Learning

Hirofumi INAGUMA

Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR

Shinsuke Sakai , Tatsuya Kawahara , Masato Mimura , Hirofumi Inaguma
arXiv: Computation and Language

2019

Multilingual End-to-End Speech Translation

Kevin Duh , Tatsuya Kawahara , Shinji Watanabe , Hirofumi Inaguma
arXiv: Computation and Language

2019

Enhancing Monotonic Multihead Attention for Streaming ASR

Tatsuya Kawahara , Masato Mimura , Hirofumi Inaguma
arXiv: Audio and Speech Processing

2020

Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder

Kevin Duh , Tatsuya Kawahara , Shinji Watanabe , Hirofumi Inaguma
arXiv: Computation and Language

2020

Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition.

Tatsuya Kawahara , Hirofumi Inaguma
arXiv: Audio and Speech Processing

2021

Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation.

Tatsuya Kawahara , Shinji Watanabe , Hirofumi Inaguma
arXiv: Computation and Language

2021

Improved Mask-CTC for Non-Autoregressive End-to-End ASR

Shinji Watanabe , Tetsuji Ogawa , Hirofumi Inaguma , Yosuke Higuchi
international conference on acoustics speech and signal processing

2021

Recent Developments on Espnet Toolkit Boosted By Conformer

Daniel Garcia-Romero , Shinji Watanabe , Tomoki Hayashi , Hirofumi Inaguma
international conference on acoustics speech and signal processing

159

2021

Acoustic-to-Word Attention-Based Model Complemented with Character-Level CTC-Based Model

Sei Ueno , Hirofumi Inaguma , Masato Mimura , Tatsuya Kawahara
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 5804 -5808

2018

Transfer Learning of Language-independent End-to-end ASR with Language Model Fusion

Hirofumi Inaguma , Jaejin Cho , Murali Karthick Baskar , Tatsuya Kawahara
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 6096 -6100

2019

Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition

Jaejin Cho , Shinji Watanabe , Takaaki Hori , Murali Karthick Baskar
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 6191 -6195

2019

End-to-End Speech-to-Dialog-Act Recognition

Viet-Trung Dang , Tianyu Zhao , Sei Ueno , Hirofumi Inaguma
Interspeech 2020 3910 -3914

2020

Distilling the Knowledge of BERT for Sequence-to-Sequence ASR.

Hayato Futami , Hirofumi Inaguma , Sei Ueno , Masato Mimura
conference of the international speech communication association 3635 -3639

2020

Leveraging Sequence-to-Sequence Speech Synthesis for Enhancing Acoustic-to-Word Speech Recognition

Masato Mimura , Sei Ueno , Hirofumi Inaguma , Shinsuke Sakai
2018 IEEE Spoken Language Technology Workshop (SLT) 477 -484

2018

ESPnet-ST: All-in-One Speech Translation Toolkit

Hirofumi Inaguma , Shun Kiyono , Kevin Duh , Shigeki Karita
meeting of the association for computational linguistics 302 -311

114

2020

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR

Hirofumi Inaguma , Yashesh Gaur , Liang Lu , Jinyu Li
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 6064 -6068

2020

Prediction of ice-breaking between participants using prosodic features in the first meeting dialogue

Hirofumi Inaguma , Koji Inoue , Shizuka Nakamura , Katsuya Takanashi
Proceedings of the 2nd Workshop on Advancements in Social Signal Processing for Multimodal Interaction 11 -15

2016

Speech recognition

Speech translation

Hirofumi Inaguma

引用次数

出版物: 52

我的账户