System and method for training cloned tone and rhythm based on Bottleneck features

作者： Sima Huapeng , Gong Xuefei

DOI:

关键词: Speech recognition 、 Training system 、 Scheme (programming language) 、 Field (computer science) 、 Speech synthesis 、 Computer science 、 Transfer of learning 、 Bottleneck 、 Tone (musical instrument) 、 Feature extraction

摘要: The invention relates to the technical field of voice synthesis, recognition and cloning, provides a cloning implementation scheme based on Bottleneck features (language featuresof audio) by combining synthesis technology, technology transfer learning technology. A training system method are included. TTS service with highnaturalness similarity is provided using small number samples, so that target user characteristics provided, problems large sample size, long manufacturing period high labor cost solved. comprises data acquisition module, an acoustic feature extraction rhythm multi-person module module. further system. steps oftraining corpus preparation, extraction, fine adjustment all modules speech synthesis.

lens.org UNKNOWN 下载加速

参考文章(11)

Masahiro Morita, Takehiko Kagoshima, Speech synthesis apparatus and method ,(2007)

Asaf Rendel, Raul Fernandez, Hybrid predictive model for enhancing prosodic expressiveness ,(2013)

Crystal Annette Nakatsu, Jessica M Christian, Ángel Rodriguez, Pilar Amores, Amores Carredano José Gabriel De, Robert James Firby, Den Berg Martin Henk Van, Nonlinguistic input for natural language generation ,(2015)

Chen Mengzhe, Zhang Qingqing, Yan Yonghong, Pan Jielin, Neural network acoustic model training method ,(2017)

Xu Yangkai, Li Xiulin, Training method and device for prosody model used for speech synthesis ,(2015)

Rana el Kaliouby, George Alexander Reichenbach, Taniya Mishra, Avatar image animation using translation vectors ,(2018)

Lenchner Jonathan, Guo Shang Qing, Initiating synthesized speech outpout from a voice-controlled device ,(2019)

Li Xuehui, Chen Zhuo, Rong Bojie, Chen Xi, Lan Zhijian, Yu Chunxia, Method for identifying sound fault based on mel energy spectrum and convolution neural network ,(2019)

Wan Li, Pan Chenghua, Li Canhong, Noisy speech recognition method based on transfer learning ,(2019)

10.

Xu Bo, Method and device for training voice synthesis model, electronic equipment and storage medium ,(2019)

System and method for training cloned tone and rhythm based on Bottleneck features

来源期刊

我的账户

System and method for training cloned tone and rhythm based on Bottleneck features

来源期刊

相似文章 1

Voice synthesis method and device based on tone cloning and electronic equipment

我的账户