作者: Dung T. Tran , Emmanuel Vincent , Denis Jouvet
DOI: 10.1109/ICASSP.2014.6854656
关键词:
摘要: Uncertainty propagation has been successfully employed for speech recognition in nonstationary noise environments. The uncertainty about the features is typically represented as a diagonal covariance matrix static only. We present framework estimating over both and dynamic full matrix. estimated then multiplied by scaling coefficients optimized on development data. achieve 21% relative error rate reduction 2nd CHiME Challenge with respect to conventional decoding without uncertainty, that five times more than achieved