The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech

作者: Keisuke Kinoshita , Marc Delcroix , Takuya Yoshioka , Tomohiro Nakatani , Emanuel Habets

DOI: 10.1109/WASPAA.2013.6701894

关键词:

摘要: … For each room, we generated near and far condition data based on the same allocated subsets. … sets of the original MC-WSJ-AV into near and far conditions, according to the speaker-…

参考文章(12)
John McDonough, Matthias Woelfel, Distant Speech Recognition ,(2009)
Valtchev, G Evermann, PC Woodland, G Moore, SJ Young, JJ Odell, D Kershaw, D Povey, DG Ollason, Mjf Gales, The HTK book version 3.4 Cambridge University Engineering Department. ,(2006)
Alex Acero, Raj Reddy, Xuedong Huang, Hsiao-Wuen Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development ,(2001)
Takuya Yoshioka, Armin Sehr, Marc Delcroix, Keisuke Kinoshita, Roland Maas, Tomohiro Nakatani, Walter Kellermann, Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition IEEE Signal Processing Magazine. ,vol. 29, pp. 114- 126 ,(2012) , 10.1109/MSP.2012.2205029
Emmanuel Vincent, Shoko Araki, Fabian Theis, Guido Nolte, Pau Bofill, Hiroshi Sawada, Alexey Ozerov, Vikrham Gowreesunker, Dominik Lutter, Ngoc Q.K. Duong, The signal separation evaluation campaign (2007-2010): Achievements and remaining challenges Signal Processing. ,vol. 92, pp. 1928- 1936 ,(2012) , 10.1016/J.SIGPRO.2011.10.007
Jon Barker, Emmanuel Vincent, Ning Ma, Heidi Christensen, Phil Green, The PASCAL CHiME speech separation and recognition challenge Computer Speech & Language. ,vol. 27, pp. 621- 633 ,(2013) , 10.1016/J.CSL.2012.10.004
Tiago H. Falk, Chenxi Zheng, Wai-Yip Chan, A Non-Intrusive Quality and Intelligibility Measure of Reverberant and Dereverberated Speech IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 18, pp. 1766- 1774 ,(2010) , 10.1109/TASL.2010.2052247
T. Robinson, J. Fransen, D. Pye, J. Foote, S. Renals, WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 81- 84 ,(1995) , 10.1109/ICASSP.1995.479278
Yi Hu, Philipos C. Loizou, Evaluation of Objective Quality Measures for Speech Enhancement IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 16, pp. 229- 238 ,(2008) , 10.1109/TASL.2007.911054
M. Lincoln, I. McCowan, J. Vepa, H.K. Maganti, The multi-channel Wall Street Journal audio visual corpus (MC-WSJ-AV): specification and initial experiments ieee automatic speech recognition and understanding workshop. pp. 357- 362 ,(2005) , 10.1109/ASRU.2005.1566470