A Maximum Likelihood Prosody Recognizer

作者： Mark Hasegawa-Johnson , Ken Chen , Aaron Cohen

DOI:

关键词:

摘要: Automatic prosody recognition (APR) is of fundamental importance for automatic speech understanding. In this paper, we propose a maximum likelihood recognizer consisting GMM-based acoustic model that models the distribution phone-level acoustic-prosodic observations (pitch, duration and energy) an ANN-based language word-level stochastic dependence between syntax. Our experiments on Radio News Corpus show our able to achieve 84% pitch accent accuracy 93% intonational phrase boundary (IPB) in leave-one-speaker-out task which has exceeded previous reported results same corpus. The tested subset Switchboard accuracies are degraded but still significantly better than chance levels.

暂无可下载资源，当前可以选择系统获取到有开放资源时通知我或者直接发起求助文献求助

参考文章(2)

Eugene Charniak, A maximum-entropy-inspired parser north american chapter of the association for computational linguistics. pp. 132- 139 ,(2000)

J. G. Carbonell, Ralf Kompe, J. Siekmann, Prosody in Speech Understanding Systems ,(1997)

A Maximum Likelihood Prosody Recognizer

来源期刊

我的账户

A Maximum Likelihood Prosody Recognizer

来源期刊

相似文章 1

Prosody dependent speech recognition on radio news corpus of American English

我的账户