MUSIC STRUCTURAL SEGMENTATION BY COMBINING HARMONIC AND TIMBRAL INFORMATION

作者: Ming Li , Ruofeng Chen

DOI:

关键词: InitializationArtificial intelligenceComputer scienceSegmentationHarmonic (mathematics)Probabilistic logicPattern recognitionSpeech recognitionFeature (machine learning)Mel-frequency cepstrumCluster analysisNon-negative matrix factorization

摘要: We propose a novel model for music structural segmentation aiming at combining harmonic and timbral information. use two-level clustering with splitting initialization random turbulence to produce segment labels using chroma MFCC separately as feature. construct score matrix combine from both aspects. Finally Nonnegative Matrix Factorization Maximum Likelihood are applied extract the final labels. By comparing sparseness, our method is capable of automatically determining number types in given song. The pairwise F-measure algorithm can reach 0.63 without rules knowledge, running on 180 Beatles songs. show be easily associated more sophisticated algorithms extended probabilistic models.

参考文章(16)
Beth Logan, Mel frequency cepstral coefficients for music modeling international symposium/conference on music information retrieval. ,(2000)
Matthias Mauch, Katy C. Noland, Simon Dixon, USING MUSICAL STRUCTURE TO ENHANCE AUTOMATIC CHORD TRANSCRIPTION international symposium/conference on music information retrieval. pp. 231- 236 ,(2009)
Anssi Klapuri, Jouni Paulus, Meinard Müller, Audio-based Music Structure Analysis 11th International Society for Music Information Retrieval Conference. pp. 625- 636 ,(2010)
Thomas Hofmann, Probabilistic latent semantic analysis uncertainty in artificial intelligence. ,vol. 15, pp. 289- 296 ,(1999)
Christopher M. Bishop, Pattern Recognition and Machine Learning ,(2006)
Jonathan Foote, Visualizing music and audio using self-similarity acm multimedia. pp. 77- 80 ,(1999) , 10.1145/319463.319472
Mark Levy, Mark Sandler, Structural Segmentation of Musical Audio by Constrained Clustering IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 16, pp. 318- 326 ,(2008) , 10.1109/TASL.2007.910781
Patrik O. Hoyer, Non-negative Matrix Factorization with Sparseness Constraints Journal of Machine Learning Research. ,vol. 5, pp. 1457- 1469 ,(2004) , 10.5555/1005332.1044709
Y. Linde, A. Buzo, R. Gray, An Algorithm for Vector Quantizer Design IEEE Transactions on Communications. ,vol. 28, pp. 84- 95 ,(1980) , 10.1109/TCOM.1980.1094577
H. Sebastian Seung, Daniel D. Lee, Algorithms for Non-negative Matrix Factorization neural information processing systems. ,vol. 13, pp. 556- 562 ,(2000)