Exploiting phonological constraints and automatic identification of speaker classes for Arabic speech recognition

作者: Iman Alsharhan

DOI:

关键词:

摘要:

参考文章(102)
Bing Xiang, Kham Nguyen, Long Nguyen, R. Schwartz, J. Makhoul, Morphological Decomposition for Arabic Broadcast News Transcription international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 1089- 1092 ,(2006) , 10.1109/ICASSP.2006.1660214
J.P. Campbell, Speaker recognition: a tutorial Proceedings of the IEEE. ,vol. 85, pp. 1437- 1462 ,(1997) , 10.1109/5.628714
Li Deng, Kuansan Wang, A. Acero, Hsiao-Wuen Hon, J. Droppo, C. Boulis, Ye-Yi Wang, D. Jacoby, M. Mahajan, C. Chelba, X.D. Huang, Distributed speech processing in miPad's multimodal user interface IEEE Transactions on Speech and Audio Processing. ,vol. 10, pp. 605- 619 ,(2002) , 10.1109/TSA.2002.804538
D. Ellis, N. Morgan, Size matters: an empirical study of neural network training for large vocabulary continuous speech recognition international conference on acoustics speech and signal processing. ,vol. 2, pp. 1013- 1016 ,(1999) , 10.1109/ICASSP.1999.759875
Yousef Ajami Alotaibi, Sid-Ahmed Selouani, Douglas O'Shaughnessy, Experiments on automatic recognition of nonnative Arabic speech Eurasip Journal on Audio, Speech, and Music Processing. ,vol. 2008, pp. 679831- ,(2008) , 10.1155/2008/679831
R. Lippmann, E. Martin, D. Paul, Multi-style training for robust isolated-word speech recognition international conference on acoustics, speech, and signal processing. ,vol. 12, pp. 705- 708 ,(1987) , 10.1109/ICASSP.1987.1169544
A. Aull, V. Zue, Lexical stress determination and its application to large vocabulary speech recognition international conference on acoustics, speech, and signal processing. ,vol. 10, pp. 1549- 1552 ,(1985) , 10.1109/ICASSP.1985.1168075
A. Messaoudi, J. Gauvain, L. Lamel, Arabic Broadcast News Transcription Using a One Million Word Vocalized Vocabulary international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 1093- 1096 ,(2006) , 10.1109/ICASSP.2006.1660215
Ahmad Emami, Lidia Mangu, Empirical study of neural network language models for Arabic speech recognition ieee automatic speech recognition and understanding workshop. pp. 147- 152 ,(2007) , 10.1109/ASRU.2007.4430100
J.L. Hieronymus, D. McKelvie, F. McInnes, Use of acoustic sentence level and lexical stress in HSMM speech recognition international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 225- 227 ,(1992) , 10.1109/ICASSP.1992.225931