A frequency domain waveform speech compression system based on product vector quantizers

作者: Ning He , A. Buzo , F. Kuhlmann

DOI: 10.1109/ICASSP.1986.1168595

关键词: Linear predictive codingMathematicsFourier transformSignal processingFrequency domainSpeech recognitionSpeech perceptionSpeech codingWaveformVector quantization

摘要: The discrete short-time Fourier transform (DS TFT) has been widely used to study and analyze several speech signal characteristics. However, this Scheme not as successfully in compression based on scalar quantization. On the other hand, most perception concepts have very interesting frequency-domain interpretations, which suggest design of schemes frequency domain analysis. In paper we apply vector quantization techniques for designing simulating systems. main conclusion can draw is that a low medium rate compressor, reproduction at least communications quality, complexity moderate, principle possible.

参考文章(8)
H. Abut, R. Gray, G. Rebolledo, Vector quantization of speech and speech-like waveforms IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 30, pp. 423- 435 ,(1982) , 10.1109/TASSP.1982.1163907
J. Makhoul, S. Roucos, H. Gish, Vector quantization in speech coding Proceedings of the IEEE. ,vol. 73, pp. 1551- 1588 ,(1985) , 10.1109/PROC.1985.13340
M. Sabin, R. Gray, Product code vector quantizers for waveform and voice coding IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 32, pp. 474- 488 ,(1984) , 10.1109/TASSP.1984.1164346
V. Cuperman, A. Gersho, Vector Predictive Coding of Speech at 16 kbits/s IEEE Transactions on Communications. ,vol. 33, pp. 685- 696 ,(1985) , 10.1109/TCOM.1985.1096372
J. Tribolet, R. Crochiere, Frequency domain coding of speech IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 27, pp. 512- 530 ,(1979) , 10.1109/TASSP.1979.1163283
A. Buzo, A. Gray, R. Gray, J. Markel, Speech coding based upon vector quantization IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 28, pp. 562- 574 ,(1980) , 10.1109/TASSP.1980.1163445
J. Tribolet, R. Crochiere, J. Flanagan, N. Jayant, B. Atal, M. Schroeder, Speech Coding IEEE Transactions on Communications. ,(1979)
R. Gray, Vector quantization IEEE Assp Magazine. pp. 75- 100 ,(1984)