作者: Raissa B. Rocha , Marcelo S. Alencar , Virginio V. Freire
DOI: 10.5281/ZENODO.44106
关键词: Acoustic model 、 Speech synthesis 、 Voice analysis 、 Voice activity detection 、 Encoder 、 Speech analytics 、 Hidden Markov model 、 Segmentation 、 Speech recognition 、 Speech segmentation 、 Speech processing 、 Speech corpus 、 Computer science 、 Linear predictive coding
摘要: Voice segmentation is used in speech recognition and system synthesis, as well phonetic voice encoders. This paper describes an implicit system, which aims to estimate the boundaries between phonemes a locution. To find marks, proposed method initially locates reference borders silent periods phonemes, vice versa measuring energy short duration periods. The are found by means of encoding region delimited were detected. evaluate performance objective evaluation using 50 locutions was performed. detected 72.41% which, 77.6% with error less or equal 10 ms 22.4% 20 ms.