Broadcast news audio classification using SVM binary trees

作者: Jozef Vavrek , Eva Vozarikova , Matus Pleva , Jozef Juhar

DOI: 10.1109/TSP.2012.6256338

关键词:

摘要: Audio classification is one of the most important task in content-based analysis and can be implemented many audio applications, such as indexing retrieving. This paper addresses problem broadcast news classification, by support vector machine - binary tree (SVM-BT) architecture, into five classes: pure speech, speech with music, environment sound, music sound. One substantial step creating architecture selection an optimal feature set for each SVM classifier. Therefore we implement F-score algorithm, effective search within a space characteristic features that mostly used speech/non-speech discrimination.

参考文章(21)
Eva Vozáriková, Jozef Juhár, Anton Čižmár, Acoustic Events Detection Using MFCC and MPEG-7 Descriptors international conference on multimedia communications. pp. 191- 197 ,(2011) , 10.1007/978-3-642-21512-4_23
Thomas Sikora, Nicolas Moreau, Hyoung-Gook Kim, MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval ,(2005)
Matthew A Siegler, Uday Jain, Bhiksha Raj, Richard M Stern, Automatic Segmentation, Classification and Clustering of Broadcast News Audio DARPA Speech Recognition Workshop, 1997. pp. 97- 99 ,(1997)
Yi-Wei Chen, Chih-Jen Lin, Combining SVMs with Various Feature Selection Strategies Feature Extraction. pp. 315- 324 ,(2006) , 10.1007/978-3-540-35488-8_13
Xinbo Gao, Hongbing Ji, Bing Han, Automatic News Audio Classification Based on Selective Ensemble SVMs Advances in Neural Networks – ISNN 2005. pp. 363- 368 ,(2005) , 10.1007/11427445_59
Po-Chuan Lin, Jia-Ching Wang, Jhing-Fa Wang, Hao-Ching Sung, Unsupervised speaker change detection using SVM training misclassification rate IEEE Transactions on Computers. ,vol. 56, pp. 1212- 1244 ,(2007) , 10.1109/TC.2007.70746
Lie Lu, Hong-Jiang Zhang, Stan Z. Li, Content-based audio classification and segmentation by using support vector machines Multimedia Systems. ,vol. 8, pp. 482- 492 ,(2003) , 10.1007/S00530-002-0065-0
Tong Zhang, C-CJ Kuo, None, Hierarchical classification of audio data for archiving and retrieving international conference on acoustics speech and signal processing. ,vol. 6, pp. 3001- 3004 ,(1999) , 10.1109/ICASSP.1999.757472