Missing data mask models with global frequency and temporal constraints

作者: Christophe Cerisara , Jean Paul Haton , Sébastien Demange

DOI:

关键词:

摘要: Missing data recognition has been developped in order to increase noise robustness automatic speech recognition. Many different factors, including the decoding process itself, shall be considered locate masks. In this work, we are considering Bayesian models of masks, where every spectral feature is classified as reliable or masked, and independent from rest signal. This classification strategy can produce unrelated small ``spots'', while experiments suggest that oracle unreliable features tend clustered into time-frequency blocks. We call undesired effect: ``checkerboard'' effect. paper, propose a new missing classifier integrates frequency temporal constraints reduce, avoid, The proposed evaluated on Aurora2 connected digit corpora. Integrating such leads significant improvements accuracy.

参考文章(7)
Hanseok Ko, Richard M. Stern, Wooil Kim, Wooil Kim, Environment-Independent Mask Estimation for Missing-Feature Reconstruction conference of the international speech communication association. pp. 2637- 2640 ,(2005)
Ljubomir Josifovski, Martin Cooke, Phil D. Green, Jon Barker, Soft decisions in missing data techniques for robust automatic speech recognition. conference of the international speech communication association. pp. 373- 376 ,(2000)
Hervé Bourlard, Andrew Morris, Jon Barker, From missing data to maybe useful data: soft data modelling for noise robust ASR Proc. WISP. ,(2001)
Andrew Morris, Data utility modelling for mismatch reduction Proc. CRAC (workshop on Consistent & Reliable Acoustic Cues for sound analysis). ,(2001)
Martin Cooke, Phil Green, Ljubomir Josifovski, Ascension Vizinho, Robust automatic speech recognition with missing and unreliable acoustic data Speech Communication. ,vol. 34, pp. 267- 285 ,(2001) , 10.1016/S0167-6393(00)00034-0
S. Demange, C. Cerisara, J.-P. Haton, Mask Estimation for Missing Data Recognition using Background Noise Sniffing international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 301- 304 ,(2006) , 10.1109/ICASSP.2006.1660017
Hans-Günter Hirsch, David Pearce, THE AURORA EXPERIMENTAL FRAMEWORK FOR THE PERFORMANCE EVALUATION OF SPEECH RECOGNITION SYSTEMS UNDER NOISY CONDITIONS conference of the international speech communication association. ,vol. 4, pp. 29- 32 ,(2000)