Speech recognition from overlapping frequency bands with output data reduction

作者: Olli Viikki , Ramalingam Hariharan , Jilei Tian , Imre Kiss , Juha Häkkinen

DOI:

关键词:

摘要: A speech recognition feature extractor includes a time-to-frequency domain transformer for generating spectral values in the frequency from signal; partitioning means first set and an additional of domain; generator group features using values; generators arranged to operate parallel, assembler assembling output at least one features, anti-aliasing sampling rate reduction block, where comprise common value.

参考文章(8)
Rathinavelu Chengalvarayan, Hierarchial subband linear predictive cepstral features for HMM-based speech recognition Journal of the Acoustical Society of America. ,vol. 111, pp. 1517- 1517 ,(2000) , 10.1121/1.1479011
L. Yapp, G. Zick, Speech recognition on MPEG/Audio encoded files international conference on multimedia computing and systems. pp. 624- 625 ,(1997) , 10.1109/MMCS.1997.609787
S. Tibrewala, H. Hermansky, Sub-band based recognition of noisy speech international conference on acoustics, speech, and signal processing. ,vol. 2, pp. 1255- 1258 ,(1997) , 10.1109/ICASSP.1997.596173
S. Okawa, E. Bocchieri, A. Potamianos, Multi-band speech recognition in noisy environments international conference on acoustics speech and signal processing. ,vol. 2, pp. 641- 644 ,(1998) , 10.1109/ICASSP.1998.675346
Chienchung Chang, Distributed voice recognition system Journal of the Acoustical Society of America. ,vol. 113, pp. 27- 27 ,(1994) , 10.1121/1.1554228
Joseph Gordon Tang, Mark Pawlewski, A method and apparatus for speaker recognition ,(1994)
V.V. Digalakis, L.G. Neumeyer, M. Perakakis, Quantization of cepstral parameters for speech recognition over the World Wide Web IEEE Journal on Selected Areas in Communications. ,vol. 17, pp. 82- 90 ,(1999) , 10.1109/49.743698