CENSREC-1-AV: An audio-visual corpus for noisy bimodal speech recognition

Satoshi Tamura , Chiyomi Miyajima , Norihide Kitaoka , Takeshi Yamada
Training 720 480 -480

51
2010
Feature-Dependent Allophone Clustering

Shigeki Sagayama , Hiroshi Shimodaira , Shigeki Matsuda , Mitsuru Nakai
conference of the international speech communication association 413 -416

4
2000
Similarity Based Language Model Construction for Voice Activated Open-Domain Question Answering

Teruhisa Misu , Kiyonori Ohtake , Shigeki Matsuda , Stijn De Saeger
international joint conference on natural language processing 536 -544

7
2011
Speech restoration based on deep learning autoencoder with layer-wised pretraining.

Hideki Kashioka , Chiori Hori , Xugang Lu , Shigeki Matsuda
conference of the international speech communication association 1504 -1507

33
2012
Speech enhancement based on deep denoising autoencoder.

Chiori Hori , Yu Tsao , Xugang Lu , Shigeki Matsuda
conference of the international speech communication association 436 -440

872
2013
Multi-Class Support Vector Machine based on Minimum Classification Error Criterion

Chiori Hori , Miho Ohsaki , Shigeru Katagiri , Hideyuki Watanabe
Technical report of IEICE. PRMU 113 ( 402) 13 -18

2014
Distributed speech translation technologies for multiparty multilingual communication

Sakriani Sakti , Michael Paul , Andrew Finch , Xinhui Hu
ACM Transactions on Speech and Language Processing 9 ( 2) 1 -27

3
2012
Speaker adaptive training for deep neural networks embedding linear transformation networks

Tsubasa Ochiai , Shigeki Matsuda , Hideyuki Watanabe , Xugang Lu
international conference on acoustics, speech, and signal processing 4605 -4609

14
2015
SPARSE REPRESENTATION BASED ON A BAG OF SPECTRAL EXEMPLARS FOR ACOUSTIC EVENT DETECTION

Xugang Lu , Yu Tsao , Shigeki Matsuda , Chiori Hori
international conference on acoustics, speech, and signal processing 6255 -6259

29
2014
SPEAKER ADAPTIVE TRAINING USING DEEP NEURAL NETWORKS

Tsubasa Ochiai , Shigeki Matsuda , Xugang Lu , Chiori Hori
international conference on acoustics, speech, and signal processing 6349 -6353

52
2014
Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation

Yu Tsao , Xugang Lu , Paul Dixon , Ting-yao Hu
Computer Speech & Language 28 ( 3) 709 -726

9
2014
Acoustic space partition based on broad phonetic class for ensemble acoustic modeling

Xugang Lu , Yu Tsao , Shigeki Matsuda , Chiori Hori
international symposium on chinese spoken language processing 311 -314

2012
Controlling the tradeoff property in a regularization framework for noise reduction

Xugang Lu , Masashi Unoki , Shigeki Matsuda , Chiori Hori
international symposium on chinese spoken language processing 201 -205

2012
Collecting sentences from web resources for constructing spontaneous Chinese language model

Xinhui Hu , Youzheng Wu , Shigeki Matsuda , Chiori Hori
international symposium on chinese spoken language processing 197 -200

1
2012
Temporal modulation normalization for robust speech feature extraction and recognition

Xugang Lu , Shigeki Matsuda , Masashi Unoki , Satoshi Nakamura
Multimedia Tools and Applications 52 ( 1) 187 -199

6
2011
Minimum Classification Error Training Incorporating Automatic Loss Smoothness Determination

Hideyuki Watanabe , Jun’ichi Tokuno , Tsukasa Ohashi , Shigeru Katagiri
Journal of Signal Processing Systems 74 ( 3) 311 -322

2014
Robust and Efficient Pattern Classification using Large Geometric Margin Minimum Classification Error Training

Hideyuki Watanabe , Tsukasa Ohashi , Shigeru Katagiri , Miho Ohsaki
Journal of Signal Processing Systems 74 ( 3) 297 -310

6
2014
Ethanol Injection into Granuration Tissue after Tracheostomy.

Kaoru Hamada , Sumito Cho , Masashi Fujimura , Kazuya Fukuoka
Nihon Kikan Shokudoka Gakkai Kaiho 42 ( 3) 284 -288

1991
A Robust Speech Recognition System for Communication Robots in Noisy Environments

Carlos Toshinori Ishi , Shigeki Matsuda , Takayuki Kanda , Takatoshi Jitsuhiro
IEEE Transactions on Robotics 24 ( 3) 759 -763

28
2008
Robot-directed speech detection using Multimodal Semantic Confidence based on speech, image, and motion

Xiang Zuo , Naoto Iwahashi , Ryo Taguchi , Shigeki Matsuda
international conference on acoustics, speech, and signal processing 2458 -2461

11
2010