作者: Stephen A. Zahorian , Hongbing Hu
DOI: 10.1121/1.2916590
关键词:
摘要: In this paper, a fundamental frequency (F(0)) tracking algorithm is presented that extremely robust for both high quality and telephone speech, at signal to noise ratios ranging from clean speech very noisy speech. The named "YAAPT," "yet another pitch tracking." based on combination of time domain processing, using the normalized cross correlation, processing. Major steps include processing original acoustic nonlinearly processed version signal, use new method computing modified autocorrelation function incorporates information multiple spectral harmonic peaks, peak picking select F(0) candidates associated figures merit, extensive dynamic programming find "best" track among candidates. was evaluated by three databases compared other published algorithms various conditions. For error rates obtained are comparable those with best results reported any algorithm; lower than methods.