A new voice transformation method based on both linear and nonlinear prediction analysis

作者: Ki Seung Lee , Dae Hee Youn , Il Whan Cha

DOI: 10.1109/ICSLP.1996.607876

关键词:

摘要: We describe a voice transformation method which changes the source speaker's acoustic features to those of target speaker. In are divided into two parts, linear and nonlinear parts. Linear parts characterized by LPC cepstrum coefficients obtained from LP analysis. The part, represents excitation signal, is modelled long-delay predictor using neural net. Conversion rules for signal generated average pitch ratio mapping codebook, based on orthogonal vector space conversion. addition, spectral envelope compensation proposed correct distortion. transformed speech listening test shows that makes it possible convert individuality while maintaining high quality.

参考文章(10)
Dae Hee Youn, Il-Whan Cha, Ki-Seung Lee, Voice personality transformation using an orthogonal vector space conversion. conference of the international speech communication association. ,(1995)
F. Diaz-de-Maria, A.R. Figueiras-Vidal, Nonlinear prediction for speech coding using radial basis functions international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 788- 791 ,(1995) , 10.1109/ICASSP.1995.479812
H. Valbret, E. Moulines, J.P. Tubach, Voice transformation using PSOLA technique conference of the international speech communication association. ,vol. 11, pp. 175- 187 ,(1992) , 10.1016/0167-6393(92)90012-V
Michael Savic, Il-Hyun Nam, Voice personality transformation Digital Signal Processing. ,vol. 1, pp. 107- 110 ,(1991) , 10.1016/1051-2004(91)90099-7
D. Childers, B. Yegnanarayana, Ke Wu, Voice conversion: Factors responsible for quality international conference on acoustics, speech, and signal processing. ,vol. 10, pp. 748- 751 ,(1985) , 10.1109/ICASSP.1985.1168479
J. Thyssen, H. Nielsen, S.D. Hansen, Non-linear short-term prediction in speech coding international conference on acoustics, speech, and signal processing. pp. 185- 188 ,(1994) , 10.1109/ICASSP.1994.389324
L. Wu, M. Niranjan, F. Fallside, Nonlinear predictive vector quantisation with recurrent neural nets Neural Networks for Signal Processing III - Proceedings of the 1993 IEEE-SP Workshop. pp. 372- 381 ,(1993) , 10.1109/NNSP.1993.471851
M. Abe, S. Nakamura, K. Shikano, H. Kuwabara, Voice conversion through vector quantization international conference on acoustics speech and signal processing. pp. 655- 658 ,(1988) , 10.1109/ICASSP.1988.196671
Y. Linde, A. Buzo, R. Gray, An Algorithm for Vector Quantizer Design IEEE Transactions on Communications. ,vol. 28, pp. 84- 95 ,(1980) , 10.1109/TCOM.1980.1094577
Enzo Mumolo, A. Carini, Francescato Diego, ADPCM with non linear predictors european signal processing conference. pp. 387- 390 ,(1994)