Learning to adapt: a meta-learning approach for speaker adaptation

作者: Peter Bell , Ondřej Klejch , Joachim Fainberg

DOI:

关键词:

摘要: The performance of automatic speech recognition systems can be improved by adapting an acoustic model to compensate for the mismatch between training and testing conditions, for …

参考文章(19)
Diederik P. Kingma, Jimmy Ba, Adam: A Method for Stochastic Optimization arXiv: Learning. ,(2014)
Yong Zhao, Jinyu Li, Jian Xue, Yifan Gong, Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data international conference on acoustics, speech, and signal processing. pp. 4310- 4314 ,(2015) , 10.1109/ICASSP.2015.7178784
I-Fan Chen, Chin-Hui Lee, Zhen Huang, Sabato Marco Siniscalchi, Jiadong Wu, Maximum a Posteriori Adaptation of Network Parameters in Deep Models arXiv: Learning. ,(2015)
Dong Yu, Kaisheng Yao, Hang Su, Gang Li, Frank Seide, KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition international conference on acoustics, speech, and signal processing. pp. 7893- 7897 ,(2013) , 10.1109/ICASSP.2013.6639201
Ossama Abdel-Hamid, Hui Jiang, Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code international conference on acoustics, speech, and signal processing. pp. 7942- 7946 ,(2013) , 10.1109/ICASSP.2013.6639211
M.J.F. Gales, Maximum likelihood linear transformations for HMM-based speech recognition Computer Speech & Language. ,vol. 12, pp. 75- 98 ,(1998) , 10.1006/CSLA.1998.0043
Hank Liao, Speaker adaptation of context dependent deep neural networks international conference on acoustics, speech, and signal processing. pp. 7947- 7951 ,(2013) , 10.1109/ICASSP.2013.6639212
Jian Xue, Jinyu Li, Dong Yu, Mike Seltzer, Yifan Gong, Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network international conference on acoustics, speech, and signal processing. pp. 6359- 6363 ,(2014) , 10.1109/ICASSP.2014.6854828
Roberto Gemello, Franco Mana, Stefano Scanzio, Pietro Laface, Renato De Mori, Linear hidden transformations for adaptation of hybrid ANN/HMM models Speech Communication. ,vol. 49, pp. 827- 835 ,(2007) , 10.1016/J.SPECOM.2006.11.005
Kaisheng Yao, Dong Yu, Frank Seide, Hang Su, Li Deng, Yifan Gong, Adaptation of context-dependent deep neural networks for automatic speech recognition spoken language technology workshop. pp. 366- 369 ,(2012) , 10.1109/SLT.2012.6424251