GAUSSIAN MIXTURE LINEAR PREDICTION

作者: Jouni Pohjalainen , Paavo Alku

DOI: 10.1109/ICASSP.2014.6854813

关键词: Artificial intelligenceLinear predictionEstimation theoryDistortionSignal processingPattern recognitionBackground noiseFourier transformAutoregressive modelMathematicsGaussian

摘要: This work introduces an approach to linear predictive signal analysis utilizing a Gaussian mixture autoregressive model. By initializing different states of the model approximately correspond target and expected type undesired components, such as background noise, iterative parameter estimation converges towards focused prediction signal. Differently initialized trained variants are evaluated using objective spectrum distortion measures well in feature extraction for speech detection presence ambient noise. In these evaluations, novel methods perform better than Fourier transform conventional prediction.

参考文章(36)
Alex Acero, Xuedong Huang, Hsiao-Wuen Hon, Spoken Language Processing Prentice-Hall. pp. 1008- ,(2001)
Aki Harma, Martin F McKinney, Janto Skowronek, Automatic surveillance of the acoustic activity in our living environment international conference on multimedia and expo. pp. 634- 637 ,(2005) , 10.1109/ICME.2005.1521503
J. Makhoul, Linear prediction: A tutorial review Proceedings of the IEEE. ,vol. 63, pp. 561- 580 ,(1975) , 10.1109/PROC.1975.9792
A. Gray, J. Markel, Distance measures for speech processing IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 24, pp. 380- 391 ,(1976) , 10.1109/TASSP.1976.1162849
Paavo Alku, Jouni Pohjalainen, Martti Vainio, Anne-Maria Laukkanen, Brad H. Story, Formant frequency estimation of high-pitched vowels using weighted linear prediction Journal of the Acoustical Society of America. ,vol. 134, pp. 1295- 1313 ,(2013) , 10.1121/1.4812756
Carlo Magi, Jouni Pohjalainen, Tom Bäckström, Paavo Alku, Stabilised weighted linear prediction Speech Communication. ,vol. 51, pp. 401- 411 ,(2009) , 10.1016/J.SPECOM.2008.12.005
Tomi Kinnunen, Padmanabhan Rajan, A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data international conference on acoustics, speech, and signal processing. pp. 7229- 7233 ,(2013) , 10.1109/ICASSP.2013.6639066
Chang-Jin Kim, Dynamic linear models with Markov-switching Journal of Econometrics. ,vol. 60, pp. 1- 22 ,(1994) , 10.1016/0304-4076(94)90036-1
Johnny Mariéthoz, Samy Bengio, A Statistical Significance Test for Person Authentication Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop. pp. 237- 244 ,(2004)