A method for identifying repetition structure in musical audio based on time series prediction

作者： Anssi Klapuri , Peter Foster , Simon Dixon

DOI:

关键词:

摘要: This paper investigates techniques for determining the repetition structure of musical audio. In particular, we consider problem segment similarity from perspective time series prediction, where seek to quantify in terms pairwise predictability between segments. To this end, propose a novel approach based on multivariate modelling audio features. Using chroma and MFCC features assumption that correct boundaries have been previously obtained, evaluate proposed against Beatles dataset. We both Queen Mary Tampere University versions dataset annotations. obtain maximum F-score 84%. Compared randomised baseline approach, result corresponds performance improvement 26 percentage points.

uni-trier.de 本地加速

ieee.org 本地加速

qmul.ac.uk PDF 下载加速

参考文章(17)

Sebastian Ewert, Meinard Müller, Chroma Toolbox: MATLAB Implementations for Extracting Variants of Chroma-based Audio Features international symposium/conference on music information retrieval. pp. 215- 220 ,(2011)

Bee Suan Ong, Structural analysis and segmentation of music signals Department of Information and Communication Technologies. ,(2007)

David Huron, Sweet Anticipation: Music and the Psychology of Expectation ,(2006)

Arnold Neumaier, Tapio Schneider, Estimation of parameters and eigenmodes of multivariate autoregressive models ACM Transactions on Mathematical Software. ,vol. 27, pp. 27- 57 ,(2001) , 10.1145/382043.382304

Jonathan Foote, Visualizing music and audio using self-similarity acm multimedia. pp. 77- 80 ,(1999) , 10.1145/319463.319472

Mark Levy, Mark Sandler, Structural Segmentation of Musical Audio by Constrained Clustering IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 16, pp. 318- 326 ,(2008) , 10.1109/TASL.2007.910781

J. Serra, H. Kantz, X. Serra, R. G. Andrzejak, Predictability of Music Descriptor Time Series and its Application to Cover Song Detection IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 20, pp. 514- 525 ,(2012) , 10.1109/TASL.2011.2162321

Jouni Paulus, Anssi Klapuri, Music Structure Analysis Using a Probabilistic Fitness Measure and a Greedy Search Algorithm IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 17, pp. 1159- 1170 ,(2009) , 10.1109/TASL.2009.2020533

Domenico Piccolo, A DISTANCE MEASURE FOR CLASSIFYING ARIMA MODELS Journal of Time Series Analysis. ,vol. 11, pp. 153- 164 ,(1990) , 10.1111/J.1467-9892.1990.TB00048.X

10.

Jianbo Shi, J. Malik, Normalized cuts and image segmentation IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 22, pp. 888- 905 ,(2000) , 10.1109/34.868688

A method for identifying repetition structure in musical audio based on time series prediction

来源期刊

我的账户

A method for identifying repetition structure in musical audio based on time series prediction

来源期刊

相似文章 2

Data reduction of audio by exploiting musical repetition

Evolution of the Informational Complexity of Contemporary Western Music

我的账户