A statistical model-based voice activity detection

作者: Jongseo Sohn , Nam Soo Kim , Wonyong Sung

DOI: 10.1109/97.736233

关键词:

摘要: In this letter, we develop a robust voice activity detector (VAD) for the application to variable-rate speech coding. The developed VAD employs decision-directed parameter estimation method likelihood ratio test. addition, propose an effective hang-over scheme which considers previous observations by first-order Markov process modeling of occurrences. According our simulation results, proposed shows significantly better performances than G.729B in low signal-to-noise (SNR) and vehicular noise environments.

参考文章(5)
Lawrence Rabiner, Biing-Hwang Juang, Fundamentals of speech recognition ,(1993)
K. Srinivasan, A. Gersho, Voice activity detection for cellular networks ieee workshop on speech coding for telecommunications. pp. 85- 86 ,(1993) , 10.1109/SCFT.1993.762351
Y. Ephraim, D. Malah, Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 33, pp. 443- 445 ,(1984) , 10.1109/TASSP.1985.1164550
O. Cappe, Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor IEEE Transactions on Speech and Audio Processing. ,vol. 2, pp. 345- 349 ,(1994) , 10.1109/89.279283
Jongseo Sohn, Wonyong Sung, A voice activity detector employing soft decision based noise spectrum adaptation international conference on acoustics speech and signal processing. ,vol. 1, pp. 365- 368 ,(1998) , 10.1109/ICASSP.1998.674443