作者: Christian Plahl , Fabio Valente , Mathew Magimai.-Doss , Ravuri Suman
DOI:
关键词: Speech recognition 、 Reduction (complexity) 、 Critical band 、 Data set 、 Mandarin Chinese 、 Energy (signal processing) 、 Computer science 、 Modulation spectrum
摘要: This paper aims at investigating the use of TANDEM features based on hierarchical processing modulation spectrum. The study is done in framework GALE project for recognition Mandarin Broadcast data. We describe improvements obtained using and addition like pitch short-term critical band energy. Results are consistent with previous findings a different LVCSR task suggesting that proposed technique effective robust across several conditions. Furthermore we integration into RWTH system trained 1600 hours data present progress 2007 2008 systems resulting approximatively 20% CER reduction set.