作者: L.J. Lee , H. Attias , Li Deng
DOI: 10.1109/ICASSP.2003.1198920
关键词: Speech enhancement 、 Machine learning 、 Natural language 、 Computer science 、 Inference 、 Bayesian network 、 Speech processing 、 Artificial intelligence 、 Speech production 、 Hidden Markov model 、 State space
摘要: This paper describes novel and powerful variational EM algorithms for the segmental switching state space models used in speech applications, which are capable of capturing key internal (or hidden) dynamics natural production. Hidden dynamic (HDMs) have recently become a class promising acoustic to incorporate crucial speech-specific knowledge overcome many inherent weaknesses traditional HMMs. However, lack efficient statistical learning is one main obstacles preventing them from being well studied widely used. Since exact inference intractable, approach taken develop effective approximate algorithms. We implemented constraint modeling present recovering hidden discrete units data only. The effectiveness developed verified by experiments on simulation Switchboard data.