作者: Stephen A. Zahorian
DOI:
关键词: Hidden Markov model 、 Natural language processing 、 Context (language use) 、 Mandarin Chinese 、 Artificial intelligence 、 Computer science 、 Syllable 、 Mel-frequency cepstrum 、 Speech recognition 、 Feature (machine learning) 、 Tone (musical instrument) 、 Linguistic Data Consortium
摘要: Abstract : This report gives a detailed summary of research work completed under Air Force Research Laboratory (AFRL) grant 56236, over the time period (November 17, 2010 - November 16, 2012). The main objective was to study various methods for Mandarin syllable recognition. Techniques were explored both base recognition and lexical tone RASC863 database, obtained from Chinese Linguistic Data Consortium used experimental work. Basel phone (60 phones) done with Hidden Markov Model recognizer. Best results approximately 69%. Human listeners establish baseline Tone accuracy humans ranges about 55% 90%, depending on how much context is given listeners. best classification neural network classifier 76%. recognizer 71%. In addition ASR experiments Mandarin, basic improved pitch tracking, refinement spectral/temporal features (DCTCs/DCSCs) done. It determined that longer intervals are preferred dynamic feature calculations than typically MFCC features. Also segment somewhat English.