作者: John H. L. Hansen , Wooil Kim
DOI:
关键词:
摘要: This study proposes an effective angry speech detection approach employing the TEO-based feature extraction. Decorrelation processing is applied to increase model training ability by decreasing correlation between elements and vector size. Minimum classification error employed discrimination other stressed models. Combination with conventional Mel frequency cepstral coefficients (MFCC) also leverage effectiveness of MFCC characterize spectral envelope signals. Experimental results over SUSAS corpus demonstrate proposed scheme at increasing accuracy on open-speaker open-vocabulary task. An improvement up 7.78% in obtained combination methods including decorrelation vector, discriminative training, classifier combination.