Objective Distance Measures for Spectral Discontinuities in Concatenative Speech Synthesis

作者:

DOI: 10.1109/WSS.2002.1224414

关键词:

摘要: The quality of unit selection based concatenative speech synthesis mainly depends on how well two successive units can be joined together to minimise the audible discontinuities. objective measure discontinuity used when selecting is known as join cost. ideal cost will perceived discontinuity, easily measurable spectral properties being joined, in order ensure smooth and natural-sounding synthetic speech. In this paper we describe a perceptual experiment conducted correlation between subjective human perception various spectrally-based measures proposed literature. Also report new distance derived from metrics these features, which have good with concatenation Our experiments state-of-the art unit-selection text-to-speech system: rVoice Rhetorical Systems Ltd.

参考文章(11)
Ann K. Syrdal, Phonetic effects on listener detection of vowel concatenation. conference of the international speech communication association. pp. 979- 982 ,(2001)
Johan Wouters, Michael W. Macon, A perceptual evaluation of distance measures for concatenative speech synthesis. conference of the international speech communication association. ,(1998)
Eam Esther Klabbers, Rnj Raymond Veldhuis, On the reduction of concatenation artefacts in the diphone synthesis conference of the international speech communication association. pp. 1983- 1986 ,(1998)
John S. Coleman, Alice Greenwood, Joseph P. Olive, Acoustics of American English Speech: A Dynamic Approach ,(2014)
Lawrence Rabiner, Biing-Hwang Juang, Fundamentals of speech recognition ,(1993)
Y. Stylianou, A.K. Syrdal, Perceptual and objective detection of discontinuities in concatenative speech synthesis international conference on acoustics, speech, and signal processing. ,vol. 2, pp. 837- 840 ,(2001) , 10.1109/ICASSP.2001.941045
S. Kullback, R. A. Leibler, On Information and Sufficiency Annals of Mathematical Statistics. ,vol. 22, pp. 79- 86 ,(1951) , 10.1214/AOMS/1177729694
A. Crowe, M.A. Jack, Globally optimising formant tracker using generalised centroids Electronics Letters. ,vol. 23, pp. 1019- 1020 ,(1987) , 10.1049/EL:19870714