Bio-inspired broad-class phonetic labelling

作者: Pedro Gómez Vilda , Rafael Martínez Olalla , M. Victoria Rodellar Biarge , L.M. Fernández , Agustin Álvarez Marquina

DOI:

关键词:

摘要: Recent studies have shown that the correct labeling of phonetic classes may help current Automatic Speech Recognition (ASR) when combined with classical parsing automata based on Hidden Markov Models (HMM).Through present paper a method for Phonetic Class Labeling (PCL) bio-inspired speech processing is described. The methodology in automatic detection formants and formant trajectories after careful separation vocal glottal components operation CF (Characteristic Frequency) neurons cochlear nucleus cortical complex human auditory apparatus. Examples class are given applicability to Processing discussed.

参考文章(18)
Guillaume Gravier, Francois Yvon, Bruno Jacob, Frédéric Bimbot, Introducing Contextual Transcription Rules in Large Vocabulary Speech Recognition Text, Speech and Language Technology. pp. 87- 106 ,(2005) , 10.1007/1-4020-2637-4_6
Pingbo Yin, Ling Ma, Mounya Elhilali, Jonathan Fritz, Shihab Shamma, Primary Auditory Cortical Responses while Attending to Different Streams Springer, Berlin, Heidelberg. pp. 257- 265 ,(2007) , 10.1007/978-3-540-73009-5_28
Pedro Gómez-Vilda, José Manuel Ferrández-Vicente, Victoria Rodellar-Biarge, Agustín Álvarez-Marquina, Luis Miguel Mazaira-Fernández, None, A Bio-inspired Architecture for Cognitive Audio international work conference on the interplay between natural and artificial computation. pp. 132- 142 ,(2007) , 10.1007/978-3-540-73053-8_14
P. Gomez, J.I. Godino, A. Alvarez, R. Martinez, V. Nieto, V. Rodellar, Evidence of glottal source spectral features found in vocal fold dynamics international conference on acoustics, speech, and signal processing. ,vol. 5, pp. 441- 444 ,(2005) , 10.1109/ICASSP.2005.1416335
Pierre C. Delattre, Alvin M. Liberman, Franklin S. Cooper, Acoustic Loci and Transitional Cues for Consonants The Journal of the Acoustical Society of America. ,vol. 27, pp. 769- 773 ,(1955) , 10.1121/1.1908024
J. Rauschecker, B Tian, M Hauser, Processing of complex sounds in the Macaque nonprimary auditory cortex Science. ,vol. 268, pp. 111- 114 ,(1995) , 10.1126/SCIENCE.7701330
Mikko Sams, Riitta Salmelin, Evidence of sharp frequency tuning in the human auditory cortex Hearing Research. ,vol. 75, pp. 67- 74 ,(1994) , 10.1016/0378-5955(94)90057-4
Nobuo Suga, Cortical computational maps for auditory imaging Neural Networks. ,vol. 3, pp. 3- 21 ,(1990) , 10.1016/0893-6080(90)90043-K
John F. Culling, C. J. Darwin, Perceptual separation of simultaneous vowels: Within and across‐formant grouping by F0 Journal of the Acoustical Society of America. ,vol. 93, pp. 3454- 3467 ,(1993) , 10.1121/1.405675