Coarse classification using a hierarchical decision tree and top down parsing

作者: L. Wilcox , B. Lowerre

DOI: 10.1109/ICASSP.1986.1169113

关键词:

摘要: In this paper, we describe a robust technique for segmenting an utterance into sequence of coarse phonetic classes. The resulting class string is used to provide contextual information further analysis, and in lexical access limit the number word candidates. Each 10 ms interval first given probability belonging each five classes: silence, vowel, nasal-like, strong fricative weak fricative. probabilities are assigned using hierarchical classification scheme with Gaussian classifiers at node. A fuzzy C-means clustering procedure learn means variances from unlabeled data. Dynamic programming align all possible strings lexicon. performance classifier has been evaluated on TI speaker independent isolated digits correct hypothesized by more than 99 percent time.

参考文章(13)
F. Chen, Lexical access and verification in a broad phonetic approach to continuous digit recognition international conference on acoustics, speech, and signal processing. ,vol. 11, pp. 1089- 1092 ,(1986) , 10.1109/ICASSP.1986.1168961
J. Mari, J. Haton, Some experiments in automatic recognition of a thousand word vocabulary international conference on acoustics, speech, and signal processing. ,vol. 9, pp. 397- 400 ,(1984) , 10.1109/ICASSP.1984.1172526
D. Huttenlocher, V. Zue, A model of lexical access from partial phonetic information international conference on acoustics, speech, and signal processing. ,vol. 9, pp. 391- 394 ,(1984) , 10.1109/ICASSP.1984.1172525
K. Shirai, T. Kobayashi, Phrase speech recognition of large vocabulary using feature in articulatory domain international conference on acoustics, speech, and signal processing. ,vol. 9, pp. 409- 412 ,(1984) , 10.1109/ICASSP.1984.1172584
Renato De Mori, Pietro Laface, Yu Mong, Parallel Algorithms for Syllable Recognition in Continuous Speech IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-7, pp. 56- 69 ,(1985) , 10.1109/TPAMI.1985.4767618
J.C. Bezdek, J.C. Dunn, Optimal Fuzzy Partitions: A Heuristic for Estimating the Parameters in a Mixture of Normal Distributions IEEE Transactions on Computers. ,vol. 24, pp. 835- 838 ,(1975) , 10.1109/T-C.1975.224317
H. Lagger, A. Waibel, A coarse phonetic knowledge source for template independent large vocabulary word recognition ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing. ,vol. 10, pp. 862- 865 ,(1985) , 10.1109/ICASSP.1985.1168314
D. Shipman, V. Zue, Properties of large lexicons: Implications for advanced isolated word recognition systems international conference on acoustics, speech, and signal processing. ,vol. 7, pp. 546- 549 ,(1982) , 10.1109/ICASSP.1982.1171902
S. Makino, K. Kido, A speaker independent word recognition system based on phoneme recognition for a large size (212 words) vocabulary international conference on acoustics, speech, and signal processing. ,vol. 9, pp. 29- 32 ,(1984) , 10.1109/ICASSP.1984.1172568
Hong Leung, V. Zue, A procedure for automatic alignment of phonetic transcriptions with continuous speech international conference on acoustics, speech, and signal processing. ,vol. 9, pp. 73- 76 ,(1984) , 10.1109/ICASSP.1984.1172426