作者: R. Haeb-Umbach
DOI: 10.1109/ICASSP.1999.758146
关键词:
摘要: We apply Fisher variate analysis to measure the effectiveness of speaker normalization techniques. A trace criterion, which measures ratio variations due different phonemes compared speakers, serves as a first assessment feature set without need for recognition experiments. By using this and by experiments we demonstrate that cepstral mean also has effect, in addition well-known channel effect. Similarly vocal tract (VTN) is shown remove inter-speaker variability. For VTN show on per sentence basis performs better than basis. Recognition results are given Wall Street Journal Hub-4 databases.