A method for identifying repetition structure in musical audio based on time series prediction

作者: Anssi Klapuri , Peter Foster , Simon Dixon

DOI:

关键词:

摘要: This paper investigates techniques for determining the repetition structure of musical audio. In particular, we consider problem segment similarity from perspective time series prediction, where seek to quantify in terms pairwise predictability between segments. To this end, propose a novel approach based on multivariate modelling audio features. Using chroma and MFCC features assumption that correct boundaries have been previously obtained, evaluate proposed against Beatles dataset. We both Queen Mary Tampere University versions dataset annotations. obtain maximum F-score 84%. Compared randomised baseline approach, result corresponds performance improvement 26 percentage points.

参考文章(17)
Sebastian Ewert, Meinard Müller, Chroma Toolbox: MATLAB Implementations for Extracting Variants of Chroma-based Audio Features international symposium/conference on music information retrieval. pp. 215- 220 ,(2011)
Bee Suan Ong, Structural analysis and segmentation of music signals Department of Information and Communication Technologies. ,(2007)
Arnold Neumaier, Tapio Schneider, Estimation of parameters and eigenmodes of multivariate autoregressive models ACM Transactions on Mathematical Software. ,vol. 27, pp. 27- 57 ,(2001) , 10.1145/382043.382304
Jonathan Foote, Visualizing music and audio using self-similarity acm multimedia. pp. 77- 80 ,(1999) , 10.1145/319463.319472
Mark Levy, Mark Sandler, Structural Segmentation of Musical Audio by Constrained Clustering IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 16, pp. 318- 326 ,(2008) , 10.1109/TASL.2007.910781
J. Serra, H. Kantz, X. Serra, R. G. Andrzejak, Predictability of Music Descriptor Time Series and its Application to Cover Song Detection IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 20, pp. 514- 525 ,(2012) , 10.1109/TASL.2011.2162321
Jouni Paulus, Anssi Klapuri, Music Structure Analysis Using a Probabilistic Fitness Measure and a Greedy Search Algorithm IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 17, pp. 1159- 1170 ,(2009) , 10.1109/TASL.2009.2020533
Domenico Piccolo, A DISTANCE MEASURE FOR CLASSIFYING ARIMA MODELS Journal of Time Series Analysis. ,vol. 11, pp. 153- 164 ,(1990) , 10.1111/J.1467-9892.1990.TB00048.X
Jianbo Shi, J. Malik, Normalized cuts and image segmentation IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 22, pp. 888- 905 ,(2000) , 10.1109/34.868688