作者: G. Nagino , M. Shozakai
DOI: 10.1109/ICASSP.2005.1415147
关键词: Acoustic space 、 Speech recognition 、 Loudspeaker 、 Hidden Markov model 、 Cosmos 、 Data visualization 、 Visualization 、 Space (punctuation) 、 Space technology 、 Computer science 、 Acoustic model
摘要: This paper proposes the technique of building an effective corpus with lower cost by using method visualizing multiple HMM acoustic models into a 2D space ("COSMOS" method: comprehensive map objective signal, previously sound) method. In experiment this paper, adapted 533 male speakers are made small quantity voice samples (10 words) per speaker. Then plotted (called COSMOS map) featuring total is generated utilizing A was built selecting 200 located only in periphery distribution and collecting (165 The model trained from showed higher performance than one other selected randomly or all map.