作者: Masanobu Abe , Satoshi Nakamura , Kiyohiro Shikano , Hisao Kuwabara
DOI: 10.1250/AST.11.71
关键词: Speech recognition 、 Overall performance 、 Active listening 、 Vector quantization 、 Distortion 、 Computer science 、 Power (physics) 、 Pitch Frequency
摘要: A new voice conversion technique through vector quantization and spectrum mapping is proposed. This based on codebooks which represent the cor respondencebetween different speakers' codebooks. The for parameters, power values, pitch frequencies are separately generated using training utterances. makes it possible to precisely control individuality. performance of this confirmed by distortion frequency difference. To evaluate overall technique, listening tests carried out two kinds conversions: one between male female speakers, other speakers. In male-to-female experiment, all converted utterances judged as female, in male-to-male conversion, 57% them identified target speaker.