作者: S. Harish , P. Vijayalakshmi , T. Nagarajan
DOI: 10.1109/ICECTECH.2011.5941739
关键词: Audio mining 、 Acoustic model 、 Speech corpus 、 Speech segmentation 、 Speech processing 、 Speech recognition 、 Speech analytics 、 Voice activity detection 、 Artificial intelligence 、 Computer science 、 Natural language processing 、 Speech synthesis
摘要: Over the last few decades speech recognition has evolved and matured enough to be used in commercial applications. The applications include automatic dictation software, voice dialling, controlled navigation simple data entry. Automatic Speech Recognition (ASR) deals with conversion of acoustic signals an utterance into text. In this work system for Tamil language is developed. requires segmentation waveform fundamental units. Word natural unit speech. However, each word trained individually there cannot any sharing parameters among words. Hence, it essential have a very large training set so that all words vocabulary are adequately trained. Also problem memory requirement which grows linearly number preferred overcome constraint phone unit. It less models they well For current work, units such as monophones triphones considered. This highlights importance segmented speech, model co-articulation effect influences production. Triphone considers effect. Monophone triphone based systems developed their performance shows above mentioned parameters.