Overlapping sound event recognition using local spectrogram features and the generalised hough transform

J Dennis,H.D Tran,E.S Chng

doi:10.1016/j.patrec.2013.02.015

Abstract

In this paper, we address the challenging task of simultaneous recognition of overlapping sound events from single channel audio. Conventional frame-based methods are not well suited to the problem, as each time frame contains a mixture of information from multiple sources. Missing feature masks are able to improve the recognition in such cases, but are limited by the accuracy of the mask, which is a non-trivial problem. In this paper, we propose an approach based on Local Spectrogram Features (LSFs) which represent local spectral information that is extracted from the two-dimensional region surrounding “keypoints” detected in the spectrogram. The keypoints are designed to locate the sparse, discriminative peaks in the spectrogram, such that we can model sound events through a set of representative LSF clusters and their occurrences in the spectrogram. To recognise overlapping sound events, we use a Generalised Hough Transform (GHT) voting system, which sums the information over many independent keypoints to produce onset hypotheses, that can detect any arbitrary combination of sound events in the spectrogram. Each hypothesis is then scored against the class distribution models to recognise the existence of the sound in the spectrogram. Experiments on a set of five overlapping sound events, in the presence of non-stationary background noise, demonstrate the potential of our approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Overlapping sound event recognition using local spectrogram features and the generalised hough transform

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Mar 13, 2013
Citations: 84

Similar Papers

Enhanced local feature approach for overlapping sound event recognition
Jonathan Dennis ... Huy Dat Tran
-
Jonathan Dennis, et. al.Jonathan Dennis ... Huy Dat Tran
01 Dec 2014
01 Dec 2014

Sound event recognition in unstructured environments using spectrogram image processing
Jonathan William Dennis
-
Jonathan William DennisJonathan William Dennis
01 Jan 2014
01 Jan 2014

Decision letter: Causal neural mechanisms of context-based object recognition
Redmond G O'Connell ... Joshua I Gold
-
Redmond G O'Connell, et. al.Redmond G O'Connell ... Joshua I Gold
03 Jun 2021
03 Jun 2021

Fast Renal Cortex Localization by Combining Generalized Hough Transform and Active Appearance Models
Dehui Xiang ... Chao Jin
-
Dehui Xiang, et. al.Dehui Xiang ... Chao Jin
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Overlapping sound event recognition using local spectrogram features and the generalised hough transform

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters