The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech

作者： Keisuke Kinoshita , Marc Delcroix , Takuya Yoshioka , Tomohiro Nakatani , Emanuel Habets

DOI: 10.1109/WASPAA.2013.6701894

关键词:

摘要: … For each room, we generated near and far condition data based on the same allocated subsets. … sets of the original MC-WSJ-AV into near and far conditions, according to the speaker-…

ieee.org 本地加速

uni-paderborn.de 本地加速

academia.edu PDF 下载加速

参考文章(12)

John McDonough, Matthias Woelfel, Distant Speech Recognition ,(2009)

Valtchev, G Evermann, PC Woodland, G Moore, SJ Young, JJ Odell, D Kershaw, D Povey, DG Ollason, Mjf Gales, The HTK book version 3.4 Cambridge University Engineering Department. ,(2006)

Alex Acero, Raj Reddy, Xuedong Huang, Hsiao-Wuen Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development ,(2001)

Takuya Yoshioka, Armin Sehr, Marc Delcroix, Keisuke Kinoshita, Roland Maas, Tomohiro Nakatani, Walter Kellermann, Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition IEEE Signal Processing Magazine. ,vol. 29, pp. 114- 126 ,(2012) , 10.1109/MSP.2012.2205029

Emmanuel Vincent, Shoko Araki, Fabian Theis, Guido Nolte, Pau Bofill, Hiroshi Sawada, Alexey Ozerov, Vikrham Gowreesunker, Dominik Lutter, Ngoc Q.K. Duong, The signal separation evaluation campaign (2007-2010): Achievements and remaining challenges Signal Processing. ,vol. 92, pp. 1928- 1936 ,(2012) , 10.1016/J.SIGPRO.2011.10.007

Jon Barker, Emmanuel Vincent, Ning Ma, Heidi Christensen, Phil Green, The PASCAL CHiME speech separation and recognition challenge Computer Speech & Language. ,vol. 27, pp. 621- 633 ,(2013) , 10.1016/J.CSL.2012.10.004

Tiago H. Falk, Chenxi Zheng, Wai-Yip Chan, A Non-Intrusive Quality and Intelligibility Measure of Reverberant and Dereverberated Speech IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 18, pp. 1766- 1774 ,(2010) , 10.1109/TASL.2010.2052247

T. Robinson, J. Fransen, D. Pye, J. Foote, S. Renals, WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 81- 84 ,(1995) , 10.1109/ICASSP.1995.479278

Yi Hu, Philipos C. Loizou, Evaluation of Objective Quality Measures for Speech Enhancement IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 16, pp. 229- 238 ,(2008) , 10.1109/TASL.2007.911054

10.

M. Lincoln, I. McCowan, J. Vepa, H.K. Maganti, The multi-channel Wall Street Journal audio visual corpus (MC-WSJ-AV): specification and initial experiments ieee automatic speech recognition and understanding workshop. pp. 357- 362 ,(2005) , 10.1109/ASRU.2005.1566470

The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech

来源期刊

我的账户

The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech

来源期刊

相似文章 10

我的账户