Distortion measures for speech processing

作者: R. Gray , A. Buzo , A. Gray , Y. Matsuyama

DOI: 10.1109/TASSP.1980.1163421

关键词:

摘要: Several properties, interrelations, and interpretations are developed for various speech spectral distortion measures. The principle results 1) the development of notions relative strength equivalence measures both in a mathematical sense corresponding to subjective coding when used minimum or nearest neighbor processing systems; 2) demonstration that Itakura-Saito related possess property similar triangle inequality systems such as quantization cluster analysis; 3) normalized model yield efficient computation algorithms generalized centroids points groups clusters frames, an important classical analysis techniques optimal quantizer design. We also argue distortions well-suited computationally, mathematically, intuitively applications.

参考文章(29)
J. D. Markel, R. M. Gray, A. H. Gray, Y. Matsuyoma, A. Buzo, Source Coding and Speech Compression ITC/USA/'78; Proceedings of the International Telemetering Conference. pp. 871- 878 ,(1978)
Y Matsuyama, R M Gray, A Buzo, Spectral Distortion Measures for Speech Compression. Stanford Univ. Report. ,(1978)
S. Levinson, L. Rabiner, A. Rosenberg, J. Wilpon, Interactive clustering techniques for selecting speaker-independent reference templates for isolated word recognition IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 27, pp. 134- 141 ,(1979) , 10.1109/TASSP.1979.1163222
Robert M. Gray, David L. Neuhoff, Paul C. Shields, A Generalization of Ornstein's $\bar d$ Distance with Applications to Information Theory Annals of Probability. ,vol. 3, pp. 315- 328 ,(1975) , 10.1214/AOP/1176996402
Robert M Gray, John C Kieffer, Yoseph Linde, Locally optimal block quantizer design Information & Computation. ,vol. 45, pp. 178- 198 ,(1980) , 10.1016/S0019-9958(80)90313-7
R. Viswanathan, J. Makhoul, Quantization properties of transmission parameters in linear predictive systems IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 23, pp. 309- 321 ,(1975) , 10.1109/TASSP.1975.1162675
A. Gray, J. Markel, Distance measures for speech processing IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 24, pp. 380- 391 ,(1976) , 10.1109/TASSP.1976.1162849
Y. Matsuyama, R. Gray, Universal tree encoding for speech IEEE Transactions on Information Theory. ,vol. 27, pp. 31- 40 ,(1981) , 10.1109/TIT.1981.1056306