作者: Disha Kaur Phull , G Bharadwaja Kumar
DOI: 10.1007/S12046-018-0976-X
关键词: Perplexity 、 Word error rate 、 Transcription (software) 、 Language model 、 Speech recognition 、 Sphinx 、 Indian English 、 Out of vocabulary 、 Computer science 、 Language modelling
摘要: A great amount of research is growing towards the automatic transcription lectures that consist numerous information and knowledge could be helpful to educational systems institutes. In large vocabulary speech recognition, language model plays a paramount role in reducing humongous search space. However, modelling very brittle when moving from one domain another or read spontaneous speech. Also, lecture recognition will have some characteristics Hence, it challenging build for this task. paper, judicious approach adapt way where close proximity topic spoken has been depicted. The evaluation devised using proposed with existing models such as CMU Sphinx, Gigaword HUB-4. We observed results analysis outperform terms word error rate, perplexity out rate. Analysis shows presented two-phase resulted an average decrease rate approximately 14% decreased by half on average.