Magnitude-only estimation of handset nonlinearity with application to speaker recognition

作者: T.F. Quatieri , D.A. Reynolds , G.C. O'Leary

DOI: 10.1109/ICASSP.1998.675372

关键词:

摘要: A method is described for estimating telephone handset nonlinearity by matching the spectral magnitude of distorted signal to output a nonlinear channel model, driven an undistorted reference. This "magnitude-only" representation allows model directly match unwanted speech formants that arise over channels and are potential source degradation in speaker recognition algorithms. As such, particularly suited algorithms use only information. The distortion consists memoryless polynomial sandwiched between two finite-length linear filters. Minimization mean-squared error, with respect parameters, relies on iterative estimation via gradient descent technique, using Jacobian correction term gradients calculated finite-element approximation. Initial work has demonstrated algorithm's usefulness reducing mismatch high- low-quality conditions.

参考文章(5)
Douglas A. Reynolds, Comparison of background normalization methods for text-independent speaker verification. conference of the international speech communication association. ,(1997)
D.A. Reynolds, M.A. Zissman, T.F. Quatieri, G.C. O'Leary, B.A. Carlson, The effects of telephone transmission degradations on speaker recognition performance international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 329- 332 ,(1995) , 10.1109/ICASSP.1995.479540
Douglas A. Reynolds, Speaker identification and verification using Gaussian mixture speaker models Speech Communication. ,vol. 17, pp. 91- 108 ,(1995) , 10.1016/0167-6393(95)00009-D
C.R. Jankowski, T.F. Quatieri, D.A. Reynolds, Measuring fine structure in speech: application to speaker identification international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 325- 328 ,(1995) , 10.1109/ICASSP.1995.479539
D.A. Reynolds, HTIMIT and LLHDB: speech corpora for the study of handset transducer effects international conference on acoustics, speech, and signal processing. ,vol. 2, pp. 1535- 1538 ,(1997) , 10.1109/ICASSP.1997.596243