A feedback framework for improved chord recognition based on NMF-based approximate note transcription

作者: Satoshi Maruo , Kazuyoshi Yoshii , Katsutoshi Itoyama , Matthias Mauch , Masataka Goto

DOI: 10.1109/ICASSP.2015.7177959

关键词: Hidden Markov modelChord (music)Speech recognitionArtificial intelligenceTranscription (music)Feature extractionAudio signalBayesian probabilityNon-negative matrix factorizationMatrix decompositionPattern recognitionMathematics

摘要: This paper presents a feedback framework that can improve chord recognition for music audio signals by performing approximate note transcription with Bayesian non-negative matrix factorization (NMF) using prior knowledge on chords. Although the names and compositions of chords are intrinsically linked each other (e.g., C major highly likely to include C, E, G notes, those notes be in chords), (multipitch analysis) have been studied independently. To solve this chicken-and-egg problem, our iterates other's results. More specifically, we first perform based NMF forces basis spectra respectively correspond different semitone-level pitches covering whole range. We then execute hidden Markov models (HMMs) use chroma features obtained from activation patterns pitches. transcription, again encourages certain kinds region activated. Experimental results showed gradually improved accuracy recognition.

参考文章(33)
Katsutoshi Itoyama, Tetsuya Ogata, Hiroshi G. Okuno, Automatic chord recognition based on probabilistic integration of acoustic features, bass sounds, and chord transition international conference industrial engineering other applications applied intelligent systems. pp. 58- 67 ,(2012) , 10.1007/978-3-642-31087-4_7
Mitsunori Ogihara, Tao Li, N-GRAM CHORD PROFILES FOR COMPOSER STYLE REPRESENTATION international symposium/conference on music information retrieval. pp. 671- 676 ,(2008)
Laurent Oudre, Cédric Févotte, Yves Grenier, TEMPLATE-BASED CHORD RECOGNITION : INFLUENCE OF THE CHORD TYPES international symposium/conference on music information retrieval. pp. 153- 158 ,(2009)
Christopher Harte, Towards automatic extraction of harmony information from music signals Queen Mary, University of London. ,(2010)
Katsutoshi Itoyama, Tetsuya Ogata, Kazunori Komatani, Kazuyoshi Yoshii, Hiroshi G. Okuno, Kouhei Sumi, Automatic chord recognition based on probabilistic integration of chord transition and bass pitch estimation international symposium/conference on music information retrieval. pp. 39- 44 ,(2008)
Matthias Mauch, Hiromasa Fujihara, Masataka Goto, Tomoyasu Nakano, Kazuyoshi Yoshii, SONGLE: A WEB SERVICE FOR ACTIVE MUSIC LISTENING IMPROVED BY USER CONTRIBUTIONS international symposium/conference on music information retrieval. pp. 311- 316 ,(2011)
Alexander Sheh, Daniel P. W. Ellis, Chord Segmentation and Recognition using EM-Trained Hidden Markov Models international symposium/conference on music information retrieval. pp. 185- 191 ,(2003) , 10.7916/D8TB1H83
Mark Sandler, Christopher Harte, Automatic Chord Identifcation using a Quantised Chromagram Journal of The Audio Engineering Society. ,(2005)
Christopher M. Bishop, Pattern Recognition and Machine Learning ,(2006)