Transcribing Bach Chorales Limitations and Potentials of Non-Negative Matrix Factorisation

Somnuk Phon-Amnuaisuk

doi:10.1186/preaccept-9933733055492847

Abstract

This article discusses our research on polyphonic music transcription using non-negative matrix factorisation (NMF). The application of NMF in polyphonic transcription offers an alternative approach in which observed frequency spectra from polyphonic audio could be seen as an aggregation of spectra from monophonic components. However, it is not easy to find accurate aggregations using a standard NMF procedure since there are many ways to satisfy the factoring of V ≈ WH. Three limitations associated with the application of standard NMF to factor frequency spectra are (i) the permutation of transcription output; (ii) the unknown factoring r; and (iii) the factoring W and H that have a tendency to be trapped in a sub-optimal solution. This work explores the uses of the heuristics that exploit the harmonic information of each pitch to tackle these limitations. In our implementation, this harmonic information is learned from the training data consisting of the pitches from a desired instrument, while the unknown effective r is approximated from the correlation between the input signal and the training data. This approach offers an effective exploitation of the domain knowledge. The empirical results show that the proposed approach could significantly improve the accuracy of the transcription output as compared to the standard NMF approach.

Highlights

Automatic music transcription concerns the translation of music sounds into written manuscripts in standard music notations
The matrix W of basis vectors is learned from each pitch from a desired instrument. This ensures that the basis vector (a.k.a. dictionary, Tone-model) represents the harmonic structure of each pitch at the expense of the basis vector matrix being applicable for that particular instrument only
3 Exploring negative matrix factorisation (NMF) for polyphonic transcription We investigate the application of NMF to extract polyphonic notes from a given polyphonic audio

Summary

Introduction

Automatic music transcription concerns the translation of music sounds into written manuscripts in standard music notations. Each neural network was trained to recognise one piano note with the frequency spectral features from approximately 30,000 samples where one-third of them were positive examples Soft computing approaches such as connectionism, support vector machine, hidden Markov model [23,24,26], etc., usually require complete training data as the performance of the model highly depends on the decision boundary constructed using the information from the training examples. This ensures that the basis vector (a.k.a. dictionary, Tone-model) represents the harmonic structure of each pitch at the expense of the basis vector matrix being applicable for that particular instrument only (e.g., the Tone-model learned from a piano will not work well with, for example, a violin) Many applications such as a performance analysis module in a guitar tutoring system, could benefit from this. Each Xk coefficient is a complex number; its corresponding magnitude and phase represent the corresponding magnitude and phase of frequency at k fs N

Piano roll representation

Switching off inactive pitches

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Audio, Speech, and Music Processing	Publication Date: Jan 1, 2012
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Transcribing Bach Chorales Limitations and Potentials of Non-Negative Matrix Factorisation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing

Lead the way for us

Similar Papers

Transcribing Bach chorales: Limitations and potentials of non-negative matrix factorisation
Somnuk Phon-Amnuaisuk
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2012
Somnuk Phon-AmnuaisukSomnuk Phon-Amnuaisuk
27 Feb 2012
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2012

Advances in Nonnegative Matrix and Tensor Factorization
A Cichocki ... P Smaragdis
Computational Intelligence and Neuroscience | VOL. 2008
A Cichocki, et. al.A Cichocki ... P Smaragdis
01 Jan 2008
Computational Intelligence and Neuroscience | VOL. 2008

Nonnegative Discriminant Matrix Factorization
Yuwu Lu ... David Zhang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 27
Yuwu Lu, et. al.Yuwu Lu ... David Zhang
01 Jul 2017
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 27

Non-negative matrix factorization for speech/music separation using source dependent decomposition rank, temporal continuity term and filtering
S Abdali ... B Nasersharif
Biomedical Signal Processing and Control | VOL. 36
S Abdali, et. al.S Abdali ... B Nasersharif
15 Apr 2017
Biomedical Signal Processing and Control | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transcribing Bach Chorales Limitations and Potentials of Non-Negative Matrix Factorisation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing