作者: Oded Ghitza
关键词: Computer science 、 Motor theory of speech perception 、 Word error rate 、 Intelligibility (communication) 、 Speech recognition 、 Decoding methods 、 Speech processing 、 Speech perception 、 Word recognition 、 Neurocomputational speech processing
摘要: The premise of this study is that current models speech perception, which are driven by acoustic features alone, incomplete, and the role decoding time during memory access must be incorporated to account for patterns observed recognition phenomena. It postulated governed a cascade neuronal oscillators, guide template-matching operations at hierarchy temporal scales. Cascaded cortical oscillations in theta, beta gamma frequency bands argued crucial intelligibility. Intelligibility high so long as these remain phase-locked auditory input rhythm. A model (Tempo) presented capable emulating recent psychophysical data on intelligibility sentences function “packaging” rate (Ghitza Greenberg, 2009). show time-compressed factor 3 (i.e., syllabic rate) poor (above 50% word error rate), but substantially restored when information stream re-packaged insertion silence gaps between successive compressed-signal intervals – counterintuitive finding, difficult explain using classical emerging naturally from Tempo architecture.