Abstract

The use of the non‐negative matrix factorization (NMF) as a decomposition technique has dramatically grown in various signal processing applications over the last years. Its computation, based on the iterative minimization of a cost function, relies on several choices, among which the distance involved in the cost function itself but also the initialization of the algorithm. These choices are all the more crucial as the usual algorithms, iterative, only ensure convergence to a local minimum. In this work, we compare three typical distances in the NMF problem (namely Euclidian, Kullback‐Leibler divergence and Itakura‐Saito distance) and their combination with different initializations, in an audio context: decomposition of the time‐frequency representation for the transcription of polyphonic music. Both the existence of global and local minima, and the efficiency of transcription are examined. Moreover, NMF update rules can be formulated in a unified framework for the three aforementioned cost functions. This formulation allows figuring out new NMF algorithms which could address the local minima question.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.