Automatic speech segmentation to improve speech synthesis performance

作者: B. Sudhakar , R. Bens Raj

DOI: 10.1109/ICCPCT.2013.6528953

关键词:

摘要: Emerging growth of information and communication technologies has influenced the research trends to focus on speech technologies. Pre-processing signal serves various purposes in any processing application. It includes noise removal, endpoint detection, pre-emphasis, framing, windowing, echo canceling etc. Out these, automatic word/sentence boundary detection is fundamental step for applications like recognition synthesis. This paper expose problem words sentences silent noisy situations. study proposes an algorithm segmentation Indian languages voiced speech. A modified data based scheme finding entropy placed better performance. The method good performance features than energy-based methods. To determine candidates segments, adaptive threshold used which are related sentences. Simulation results that this will provide energy algorithms.

参考文章(7)
John G. Proakis, John R. Deller, John H. Hansen, Discrete-Time Processing of Speech Signals ,(1993)
H.R. Pfitzinger, S. Burger, S. Heid, Syllable detection in read and spontaneous speech international conference on spoken language processing. ,vol. 2, pp. 1261- 1264 ,(1996) , 10.1109/ICSLP.1996.607838
N. Jittiwarangkul, S. Jitapunkul, S. Luksaneeyanavin, V. Ahkuputra, C. Wutiwiwatchai, Thai syllable segmentation for connected speech based on energy asia pacific conference on circuits and systems. pp. 169- 172 ,(1998) , 10.1109/APCCAS.1998.743703
Paul Mermelstein, Automatic segmentation of speech into syllabic units Journal of the Acoustical Society of America. ,vol. 58, pp. 880- 883 ,(1975) , 10.1121/1.380738
L. Lamel, L. Rabiner, A. Rosenberg, J. Wilpon, An improved endpoint detector for isolated word recognition IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 29, pp. 777- 785 ,(1981) , 10.1109/TASSP.1981.1163642
A. Ganapathiraju, L. Webster, J. Trimble, K. Bush, P. Kornman, Comparison of energy-based endpoint detectors for speech signal processing southeastcon. pp. 500- 503 ,(1996) , 10.1109/SECON.1996.510121
J.-C. Junqua, B. Mak, B. Reaves, A robust algorithm for word boundary detection in the presence of noise IEEE Transactions on Speech and Audio Processing. ,vol. 2, pp. 406- 412 ,(1994) , 10.1109/89.294354