The Effect of Memory Inclusion on Mutual Information Between Speech Frequency Bands

作者: A.H. Nour-Eldin , T.Z. Shabestary , P. Kabal

DOI: 10.1109/ICASSP.2006.1660588

关键词:

摘要: In this paper, we investigate the effect of temporal correlation on dependence between speech narrow and high frequency bands covering 0.3–3.4 kHz 3.7–8 ranges, respectively. We follow technique using Gaussian mixture modelling spectral envelopes represented by Mel-frequency cepstral coefficients. The disjoint is quantified through mutual information (MI) its ratio to highband entropy. Speech exhibits considerable that not explicitly accounted for static parametrization envelopes. Including memory in (through delta features) incorporates such modelling, hence, MI gains are be expected resulting bandwidth extension with better performance. Results show exploiting features can increase certainty about (ratio entropy) as much 216% relatively, corresponding an absolute 12%.

参考文章(8)
P. Jax, P. Vary, Artificial bandwidth extension of speech signals using MMSE estimation based on a hidden Markov model international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 680- 683 ,(2003) , 10.1109/ICASSP.2003.1198872
R. Hagen, Spectral quantization of cepstral coefficients international conference on acoustics, speech, and signal processing. pp. 509- 512 ,(1994) , 10.1109/ICASSP.1994.389244
K.K. Paliwal, B.S. Atal, Efficient vector quantization of LPC parameters at 24 bits/frame IEEE Transactions on Speech and Audio Processing. ,vol. 1, pp. 3- 14 ,(1993) , 10.1109/89.221363
Mattias Nilsson, Harald Gustaftson, Soren Vang Andersen, W. Bastiaan Kleijn, Gaussian mixture model based mutual information estimation between frequency bands in speech international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 525- 528 ,(2002) , 10.1109/ICASSP.2002.5743770
P. Jax, P. Vary, Feature selection for improved bandwidth extension of speech signals international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 697- 700 ,(2004) , 10.1109/ICASSP.2004.1326081
Peter Jax, Peter Vary, An upper bound on the quality of artificial bandwidth extension of narrowband speech signals international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 237- 240 ,(2002) , 10.1109/ICASSP.2002.5743698
M. Nilsson, S.V. Andersen, W.B. Kleijn, On the mutual information between frequency bands in speech international conference on acoustics, speech, and signal processing. ,vol. 3, pp. 1327- 1330 ,(2000) , 10.1109/ICASSP.2000.861823
Jax, Vary, An upper bound on the quality of artificial bandwidth extension of narrowband speech signals international conference on acoustics, speech, and signal processing. ,vol. 1, ,(2002) , 10.1109/ICASSP.2002.1005720