Discovering speech phones using convolutive non-negative matrix factorisation with a sparseness constraint

Paul D O’Grady,Barak A Pearlmutter

doi:10.1016/j.neucom.2008.01.033

Discovering speech phones using convolutive non-negative matrix factorisation with a sparseness constraint

Paul D O’Grady, Barak A Pearlmutter

Open Access

https://doi.org/10.1016/j.neucom.2008.01.033

Copy DOI

Journal: Neurocomputing	Publication Date: Sep 13, 2008
Citations: 81

Affiliation: University College Dublin, National University of Ireland

#Speech Phones #Signal Processing Tasks + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Discovering a representation that allows auditory data to be parsimoniously represented is useful for many machine learning and signal processing tasks. Such a representation can be constructed by non-negative matrix factorisation (NMF), a method for finding parts-based representations of non-negative data. Here, we present an extension to convolutive NMF that includes a sparseness constraint, where the resultant algorithm has multiplicative updates and utilises the beta divergence as its reconstruction objective. In combination with a spectral magnitude transform of speech, this method discovers auditory objects that resemble speech phones along with their associated sparse activation patterns. We use these in a supervised separation scheme for monophonic mixtures, finding improved separation performance in comparison to standard convolutive NMF.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Neurocomputing

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.