作者: Sin-Horng Chen , Yih-Ru Wang , Chen-Yu Chiang , Yuan-Fu Liao , Ming-Chieh Liu
DOI:
关键词: Word (computer architecture) 、 Artificial intelligence 、 State model 、 Structure (mathematical logic) 、 Speech recognition 、 Segmentation 、 Factored language model 、 Natural language processing 、 Computer science 、 Syllable 、 SIGNAL (programming language) 、 Speech synthesis
摘要: A Chinese speech recognition system and method is disclosed. Firstly, a signal received recognized to output word lattice. Next, the lattice received, arcs of are rescored reranked with prosodic break model, state syllable prosodic-acoustic syllable-juncture model factored language so as tag, tag phonetic segmentation which correspond signal. The present invention performs rescoring in two-stage way promote rate basic information labels provide structure for rear-stage voice conversion synthesis.