作者: J.D. Hoyt , H. Wechsler
DOI: 10.1109/ICASSP.1994.389676
关键词: Feature vector 、 Noise shaping 、 Word error rate 、 Speech processing 、 Speech enhancement 、 Artificial intelligence 、 Computer science 、 Speech recognition 、 Noise 、 Acoustic testing 、 Pattern recognition 、 Formant
摘要: This paper describes research to develop an efficient system that provides a binary decision as the presence of speech in short (one three second) time sample acoustic signal. A method which is and reliably detects human structured noise (such wind, music, traffic sounds, etc.) described. Two separate algorithms were developed. The first algorithm by testing for concave and/or convex formant shapes. second statistical pattern classifier utilizing radial basis function (RBF) networks with mel-cepstra feature vectors. Classification errors are not consistent across these two different methods. As consequence, we plan reduce our error rate fusion >