Rhythm Transcription of Polyphonic Piano Music Based on Merged-Output HMM for Multiple Voices

作者: Eita Nakamura , Kazuyoshi Yoshii , Shigeki Sagayama

DOI: 10.1109/TASLP.2017.2662479

关键词: Hidden Markov modelSpeech recognitionRhythmPianoInferenceSource codeComputer sciencePolyrhythmTranscription (music)MIDI

摘要: In a recent conference paper, we have reported rhythm transcription method based on merged-output hidden Markov model HMM that explicitly describes the multiple-voice structure of polyphonic music. This solves major problem conventional methods could not properly describe nature multiple voices as in polyrhythmic scores or phenomenon loose synchrony between voices. this present complete description proposed and develop an inference technique, which is valid for any HMMs, output probabilities depend past events. We also examine influence architecture parameters terms accuracies voice separation perform comparative evaluations with six other algorithms. Using MIDI recordings classical piano pieces, found outperformed by more than 12 points accuracy performances performed almost good best one non-polyrhythmic performances. reveals state-of-the-art first time literature. Publicly available source codes are provided future comparisons.

参考文章(30)
Haruhiro Katayose, Mitsuyo Hashida, Toshie Matsui, A NEW MUSIC DATABASE DESCRIBING DEVIATION INFORMATION OF PERFORMANCE EXPRESSIONS international symposium/conference on music information retrieval. pp. 489- 494 ,(2008)
Carl Schachter, Felix Salzer, Counterpoint in Composition: The Study of Voice Leading ,(1969)
Zoubin Ghahramani, Michael Jordan, None, Factorial Hidden Markov Models neural information processing systems. ,vol. 29, pp. 472- 478 ,(1995) , 10.1023/A:1007425814087
Makoto Tanji, Daichi Ando, Hitoshi Iba, Improving Metrical Grammar with Grammar Expansion AI 2008: Advances in Artificial Intelligence. pp. 180- 191 ,(2008) , 10.1007/978-3-540-89378-3_18
Jeff Pressing, Peter Lawrence, Transcribe: A Comprehensive Autotranscription Program international computer music conference. ,vol. 1993, ,(1993)
Christopher Raphael, Automated Rhythm Transcription. international symposium/conference on music information retrieval. ,(2001)
Darrell Conklin, Representation and Discovery of Vertical Patterns in Music international conference on music and artificial intelligence. pp. 32- 42 ,(2002) , 10.1007/3-540-45722-4_5
David Temperley, Daniel Sleator, Modeling Meter and Harmony: A Preference-Rule Approach Computer Music Journal. ,vol. 23, pp. 10- 27 ,(1999) , 10.1162/014892699559616
Bruno Gingras, Stephen McAdams, Improved Score-performance Matching Using Both Structural and Temporal Information from MIDI Recordings Journal of New Music Research. ,vol. 40, pp. 43- 57 ,(2011) , 10.1080/09298215.2010.545422