Audio Signal Classification Research Articles

The increasing demand for cost-efficient biodiversity data at large spatiotemporal scales has led to an increase in the collection of large ecoacoustic datasets. Whilst the ease of collection and storage of audio data has rapidly increased and costs fallen, methods for robust analysis of the data have not developed so quickly. Identification and classification of audio signals to species level is extremely desirable, but reliability can be highly affected by non-target noise, especially rainfall. Despite this demand, there are few easily applicable pre-processing methods available for rainfall detection for conservation practitioners and ecologists. Here, we use threshold values of two simple measures, Power Spectrum Density (amplitude) and Signal-to-Noise Ratio at two frequency bands, to differentiate between the presence and absence of heavy rainfall. We assess the effect of using different threshold values on Accuracy and Specificity. We apply the method to four datasets from both tropical and temperate regions, and find that it has up to 99% accuracy on tropical datasets (e.g. from the Brazilian Amazon), but performs less well in temperate environments. This is likely due to the intensity of rainfall in tropical forests and its falling on dense, broadleaf vegetation amplifying the sound. We show that by choosing between different threshold values, informed trade-offs can be made between Accuracy and Specificity, thus allowing the exclusion of large amounts of audio data containing rainfall in all locations without the loss of data not containing rain. We assess the impact of using different sample sizes of audio data to set threshold values, and find that 200 15 s audio files represents an optimal trade-off between effort, accuracy and specificity in most scenarios. This methodology and accompanying R package ‘hardRain’ is the first automated rainfall detection tool for pre-processing large acoustic datasets without the need for any additional rain gauge data.

AbstractMusical genre classification is put into context by explaining about the structures in music and how it is analyzed and perceived by humans. The increase of the music databases on the personal collection and the Internet has brought a great demand for music information retrieval, and especially automatic musical genre classification. In this research we focused on combining information from the audio signal than different sources. This paper presents a comprehensive machine learning approach to the problem of automatic musical genre classification using the audio signal. The proposed approach uses two feature vectors, Support vector machine classifier with polynomial kernel function and machine learning algorithms. More specifically, two feature sets for representing frequency domain, temporal domain, cepstral domain and modulation frequency domain audio features are proposed. Using our proposed features SVM act as strong base learner in AdaBoost, so its performance of the SVM classifier cannot improve using boosting method. The final genre classification is obtained from the set of individual results according to a weighting combination late fusion method and it outperformed the trained fusion method. Music genre classification accuracy of 78% and 81% is reported on the GTZAN dataset over the ten musical genres and the ISMIR2004 genre dataset over the six musical genres, respectively. We observed higher classification accuracies with the ensembles, than with the individual classifiers and improvements of the performances on the GTZAN and ISMIR2004 genre datasets are three percent on average. This ensemble approach show that it is possible to improve the classification accuracy by using different types of domain based audio features.

Audio Signal Classification Research Articles

Related Topics

Articles published on Audio Signal Classification

HardRain: An R package for quick, automated rainfall detection in ecoacoustic datasets using a threshold-based approach

Feature selection based on MBFOA for audio signal classification under consideration of Gaussian white noise

Dictionary learning based on M‐PCA‐N for audio signal sparse representation

Modified DCTNet for audio signals classification

Soundscape Audio Signal Classification and Segmentation Using Listeners Perception of Background and Foreground Sound

CALCULATION OF PITCH FOR THE IRANIAN TRADITIONAL MUSIC GENRE CLASSIFICATION STUDIED ON TWO STRINGED INSTRUMENTS THE TAR AND THE SETAR

An Efficient Approach for Segmentation, Feature Extraction and Classification of Audio Signals

PyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis.

Study of Algorithms for Separation of SingingVoice from Music

Comparative Study of Filter Performance for Separation of Singing Voice from Music Accompaniment

Soft Margin Based Low-Rank Audio Signal Classification

Classification of reverberant audio signals using clustered ad hoc distributed microphones

Classification of audio events using permutation transformation

Automatic Music Genre Classification of Audio Signals with Machine Learning Approaches

Automatic Music Genre Classification of Audio Signals with Machine Learning Approaches

An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

Time–Frequency Matrix Feature Extraction and Classification of Environmental Audio Signals

Generalizability and Simplicity as Criteria in Feature Selection: Application to Mood Classification in Music

Spectral and Temporal Periodicity Representations of Rhythm for the Automatic Classification of Music Audio Signal

Classification of audio signals using AANN and GMM

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Audio Signal Classification Research Articles

Related Topics

Articles published on Audio Signal Classification

HardRain: An R package for quick, automated rainfall detection in ecoacoustic datasets using a threshold-based approach

Feature selection based on MBFOA for audio signal classification under consideration of Gaussian white noise

Dictionary learning based on M‐PCA‐N for audio signal sparse representation

Modified DCTNet for audio signals classification

Soundscape Audio Signal Classification and Segmentation Using Listeners Perception of Background and Foreground Sound

CALCULATION OF PITCH FOR THE IRANIAN TRADITIONAL MUSIC GENRE CLASSIFICATION STUDIED ON TWO STRINGED INSTRUMENTS THE TAR AND THE SETAR

An Efficient Approach for Segmentation, Feature Extraction and Classification of Audio Signals

PyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis.

Study of Algorithms for Separation of SingingVoice from Music

Comparative Study of Filter Performance for Separation of Singing Voice from Music Accompaniment

Soft Margin Based Low-Rank Audio Signal Classification

Classification of reverberant audio signals using clustered ad hoc distributed microphones

Classification of audio events using permutation transformation

Automatic Music Genre Classification of Audio Signals with Machine Learning Approaches

Automatic Music Genre Classification of Audio Signals with Machine Learning Approaches

An analysis of content-based classification of audio signals using a fuzzy c-means algorithm

Time–Frequency Matrix Feature Extraction and Classification of Environmental Audio Signals

Generalizability and Simplicity as Criteria in Feature Selection: Application to Mood Classification in Music

Spectral and Temporal Periodicity Representations of Rhythm for the Automatic Classification of Music Audio Signal

Classification of audio signals using AANN and GMM