Web Services Based Hybrid Recognizer of Lithuanian Voice Commands

作者: V. Rudzionis , G. Raskinis , R. Maskeliunas , A. Rudzionis , K. Ratkevicius

DOI: 10.5755/J01.EEE.20.9.8713

关键词:

摘要: This paper presents the recently developed medical-pharmaceutical informative system with voice user interface. is first computerized oriented towards healthcare services and industry where Lithuanian commands are used as a primary mean for control. Another essential property of its hybrid nature: two different recognizers - an adapted commercial Spanish speech recognizer available from Microsoft locally HMM based on acoustic models – operating in parallel. The recognition hypotheses produced by those joined together using logical rules obtained decision induction algorithms such Ripper. All these measures approaches allowed achieve very high speaker independent accuracy acceptable implementation practice. best achieved was 98.9 % 1000 commands. optimization issues related development system. DOI: http://dx.doi.org/10.5755/j01.eee.20.9.8713

参考文章(12)
Li Deng, Front-End, Back-End, and Hybrid Techniques for Noise-Robust Speech Recognition Robust Speech Recognition of Uncertain or Missing Data. pp. 67- 99 ,(2011) , 10.1007/978-3-642-21317-5_4
Vytautas Rudžionis, Kastytis Ratkevičius, Algimantas Rudžionis, Gailius Raškinis, Rytis Maskeliunas, Recognition of Voice Commands Using Hybrid Approach international conference on information and software technologies. pp. 249- 260 ,(2013) , 10.1007/978-3-642-41947-8_21
Rytis Maskeliunas, Algimantas Rudzionis, Vytautas Rudzionis, Advances on the use of the foreign language recognizer COST'09 Proceedings of the Second international conference on Development of Multimodal Interfaces: active Listening and Synchrony. pp. 217- 224 ,(2009) , 10.1007/978-3-642-12397-9_18
William W. Cohen, Fast Effective Rule Induction Machine Learning Proceedings 1995. pp. 115- 123 ,(1995) , 10.1016/B978-1-55860-377-6.50023-2
T. Sledevic, G. Tamulevicius, D. Navakauskas, Upgrading FPGA Implementation of Isolated Word Recognition System for a Real-Time Operation Elektronika Ir Elektrotechnika. ,vol. 19, pp. 123- 128 ,(2013) , 10.5755/J01.EEE.19.10.5907
Jui-Ting Huang, Jinyu Li, Dong Yu, Li Deng, Yifan Gong, Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers international conference on acoustics, speech, and signal processing. pp. 7304- 7308 ,(2013) , 10.1109/ICASSP.2013.6639081
V. Rudzionis, G. Raskinis, G. Raskinis, R. Maskeliunas, R. Maskeliunas, A. Rudzionis, A. Rudzionis, Kastytis Ratkevicius, Kastytis Ratkevicius, Comparative Analysis of Adapted Foreign Language and Native Lithuanian Speech Recognizers for Voice User Interface Elektronika Ir Elektrotechnika. ,vol. 19, pp. 90- 93 ,(2013) , 10.5755/J01.EEE.19.7.5171
Lukas Burget, Petr Schwarz, Mohit Agarwal, Pinar Akyazi, Kai Feng, Arnab Ghoshal, Ondrej Glembek, Nagendra Goel, Martin Karafiat, Daniel Povey, Ariya Rastrow, Richard C. Rose, Samuel Thomas, Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models international conference on acoustics, speech, and signal processing. pp. 4334- 4337 ,(2010) , 10.1109/ICASSP.2010.5495646
L.R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition Proceedings of the IEEE. ,vol. 77, pp. 267- 296 ,(1989) , 10.1109/5.18626
Samuel Thomas, Sriram Ganapathy, Hynek Hermansky, Multilingual MLP features for low-resource LVCSR systems 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). pp. 4269- 4272 ,(2012) , 10.1109/ICASSP.2012.6288862