Music structure analysis by subspace modeling

作者: Constantine Kotropoulos , Yannis Panagakis

DOI:

关键词:

摘要: Automatic music structure analysis is casted as a subspace clustering problem. By assuming that the feature vectors extracted from specific segment are drawn single subspace, any sequence of such derived recording will lie in union many subspaces segments are. First, sparse and low-rank tested for by employing three types beat-synchronous audio sequences. Next, novel computational efficient method proposed, coined ridge representation (RRSC). The performance aforementioned methods assessed conducting experiments on manually annotated Beatles benchmark dataset. experimental results indicate that: 1) RRSC comparable or exceeds 2) outperforms state-of-the-art proposed analysis.

参考文章(17)
Anssi Klapuri, Jouni Paulus, Meinard Müller, Audio-based Music Structure Analysis 11th International Society for Music Information Retrieval Conference. pp. 625- 636 ,(2010)
Ming Li, Ruofeng Chen, MUSIC STRUCTURAL SEGMENTATION BY COMBINING HARMONIC AND TIMBRAL INFORMATION international symposium/conference on music information retrieval. pp. 477- 482 ,(2011)
R. Lyon, A computational model of filtering, detection, and compression in the cochlea international conference on acoustics, speech, and signal processing. ,vol. 7, pp. 1282- 1285 ,(1982) , 10.1109/ICASSP.1982.1171644
MJ Michael Bruderer, MF McKinney, AG Armin Kohlrausch, Structural boundary perception in popular music. international symposium/conference on music information retrieval. pp. 198- 201 ,(2006)
Guangcan Liu, Zhouchen Lin, Shuicheng Yan, Ju Sun, Yong Yu, Yi Ma, Robust Recovery of Subspace Structures by Low-Rank Representation IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 35, pp. 171- 184 ,(2013) , 10.1109/TPAMI.2012.88
Ehsan Elhamifar, Rene Vidal, Sparse subspace clustering computer vision and pattern recognition. pp. 2790- 2797 ,(2009) , 10.1109/CVPR.2009.5206547
Rajendra Bhatia, Fuad Kittaneh, Norm inequalities for partitioned operators and an application Mathematische Annalen. ,vol. 287, pp. 719- 726 ,(1990) , 10.1007/BF01446925
Mark Levy, Mark Sandler, Structural Segmentation of Musical Audio by Constrained Clustering IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 16, pp. 318- 326 ,(2008) , 10.1109/TASL.2007.910781
Daniel P. W. Ellis, Beat Tracking by Dynamic Programming Journal of New Music Research. ,vol. 36, pp. 51- 60 ,(2007) , 10.1080/09298210701653344
Jouni Paulus, Anssi Klapuri, Music Structure Analysis Using a Probabilistic Fitness Measure and a Greedy Search Algorithm IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 17, pp. 1159- 1170 ,(2009) , 10.1109/TASL.2009.2020533