Audio feature enhancement based on quaternion filtering and deep hashing

Xun Jin,Bingkui Sun,De Li

doi:10.1016/j.neucom.2024.128727

Abstract

This paper aims to solve the problems of difficult convergence of audio model training, large data demand, and large dimensionality of storage space for audio-generated feature vectors. To this end, this paper proposes the use of quaternion Gabor filtering to suppress the background information of the spectrogram and reduce the interference of the data for the case of insufficient data alignment between audio data and image data after shifting the domain. In addition, different scales of window lengths and frame shifts are used to capture the connections between different vocal objects. To address the problem that the generated feature vectors are large dimensional, we use a deep hash module to map high-dimensional features to low-dimensional features and use a probability function to make the learned samples more consistent with the overall distribution. In the experimental evaluation, the proposed method was evaluated on the environmental sound classification dataset and the music genre classification dataset. The proposed method uses only a common backbone network and improves the accuracy of audio recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Audio feature enhancement based on quaternion filtering and deep hashing

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

English Character Recognition using Quadrant Feature Extraction Method and Artificial Neural Nework
...
International Journal of Advanced Research in Computer Science | VOL. 3
, et. al. ...
01 Jan 2012
International Journal of Advanced Research in Computer Science | VOL. 3

Character Recognition using Dynamic Windows
Mithun Biswas ... Ranjan Parekh
International Journal of Computer Applications | VOL. 41
Mithun Biswas, et. al.Mithun Biswas ... Ranjan Parekh
31 Mar 2012
International Journal of Computer Applications | VOL. 41

Feature Vector Creation Using Hierarchical Data Structure for Spatial Domain Image Retrieval
Sushila Aghav-Palwe ... Dhirendra Mishra
Procedia Computer Science | VOL. 167
Sushila Aghav-Palwe, et. al.Sushila Aghav-Palwe ... Dhirendra Mishra
01 Jan 2020
Procedia Computer Science | VOL. 167

Walsh Transform based Feature vector generation for Image Database Classification
Jagruti Ketan Save
-
Jagruti Ketan SaveJagruti Ketan Save
01 Jun 2016
01 Jun 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Audio feature enhancement based on quaternion filtering and deep hashing

Abstract

Talk to us

Similar Papers

More From: Neurocomputing