Channel selection by class separability measures for automatic transcriptions on distant microphones.

作者: Matthias Wölfel

DOI:

关键词:

摘要: Channel selection is important for automatic speech recognition as the signal quality of one channel might be significantly better than those other channels and therefore, microphone array or blind source separation techniques not lead to improvements over best single microphone. The mayor challenge, however, find this particular who leading most accurate classification. In paper we present a novel method, based on class separability, improve multi-source far distance speech-totext transcriptions. Class separability measures have advantage, compared methods such noise ratio (SNR), that they are able evaluate actual features system. We evaluated NISTs RT-07 development set observe significant in word accuracy SNR methods. also used technique evaluation.

参考文章(8)
Christian Fügen, Matthias Wölfel, Shajith Ikbal, John W. McDonough, Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures conference of the international speech communication association. ,(2006)
Y. Shimizu, S. Kajita, K. Takeda, F. Itakura, Speech recognition based on space diversity using distributed multi-microphone international conference on acoustics, speech, and signal processing. ,vol. 3, pp. 1747- 1750 ,(2000) , 10.1109/ICASSP.2000.862090
Matthias Wolfel, Warped-twice minimum variance distortionless response spectral estimation european signal processing conference. pp. 1- 4 ,(2006)
Sebastian Stüker, Christian Fügen, Matthias Wölfel, Shajith Ikbal, Mari Ostendorf, John McDonough, Florian Kraft, Kornel Laskowski, Advances in Lecture Recognition: The ISL RT-06S Evaluation System international conference on spoken language processing. ,(2006)
X. Anguera, C. Woofers, J. Hernando, Speaker diarization for multi-party meetings using acoustic fusion ieee automatic speech recognition and understanding workshop. pp. 426- 431 ,(2005) , 10.1109/ASRU.2005.1566478
R. Haeb-Umbach, Investigations on inter-speaker variability in the feature space international conference on acoustics speech and signal processing. ,vol. 1, pp. 397- 400 ,(1999) , 10.1109/ICASSP.1999.758146
R. Haeb-Umbach, H. Ney, Linear discriminant analysis for improved large vocabulary continuous speech recognition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 13- 16 ,(1992) , 10.1109/ICASSP.1992.225984
Yasunari Obuchi, Multiple-microphone robust speech recognition using decoder-based channel selection. conference of the international speech communication association. pp. 52- ,(2004)