Set Of Audio Features Research Articles

This study presents a novel approach to emergency vehicle classification that leverages a comprehensive set of informative audio features to distinguish between ambulance sirens, fire truck sirens, and traffic noise. A unique contribution lies in combining time domain features, including root mean square (RMS) and zero-crossing rate, to capture the temporal characteristics, like signal energy changes, with frequency domain features derived from short-time Fourier transform (STFT). These include spectral centroid, spectral bandwidth, and spectral roll-off, providing insights into the sound’s frequency content for differentiating siren patterns from traffic noise. Additionally, Mel-frequency cepstral coefficients (MFCCs) are incorporated to capture the human-like auditory perception of the spectral information. This combination captures both temporal and spectral characteristics of the audio signals, enhancing the model’s ability to discriminate between emergency vehicles and traffic noise compared to using features from a single domain. A significant contribution of this study is the integration of data augmentation techniques that replicate real-world conditions, including the Doppler effect and noise environment considerations. This study further investigates the effectiveness of different machine learning algorithms applied to the extracted features, performing a comparative analysis to determine the most effective classifier for this task. This analysis reveals that the support vector machine (SVM) achieves the highest accuracy of 99.5%, followed by random forest (RF) and k-nearest neighbors (KNNs) at 98.5%, while AdaBoost lags at 96.0% and long short-term memory (LSTM) has an accuracy of 93%. We also demonstrate the effectiveness of a stacked ensemble classifier, and utilizing these base learners achieves an accuracy of 99.5%. Furthermore, this study conducted leave-one-out cross-validation (LOOCV) to validate the results, with SVM and RF achieving accuracies of 98.5%, followed by KNN and AdaBoost, which are 97.0% and 90.5%. These findings indicate the superior performance of advanced ML techniques in emergency vehicle classification.

Background and objectiveA fetal phonocardiography signal can be hard to interpret and classify due to various sources of additive noise in the womb, spanning from fetal movement to maternal heart sounds. Nevertheless, the non-invasive nature of the method makes it potentially suitable for long-term monitoring of fetal health, especially since it can be implemented on ubiquitous devices such as smartphones. We have employed empirical mode decomposition for the extraction of intrinsic mode functions that would enable the utilization of additional characteristics from the signal. MethodsFetal heart recordings from 7 pregnant women in the 3rd trimester or pregnancy were taken in parallel with a measurement microphone and a portable Doppler device. Signal peaks positions from the Doppler were taken as the locations of S1 heart sounds and subsequently used as classification labels for the microphone signal. After employing a moving window approach for segmentation, more than 7600 observations were stored in the final dataset. The 135 extracted features consisted of typical audio temporal and spectral characteristics, each taken from separate sets of audio signals and intrinsic mode functions. We have used a number of metrics and methods to validate the usability of features, including univariate analysis of feature ranking and importance. Furthermore, we have used machine learning to train a number of classifiers to validate the usability of features based on intrinsic mode functions, taking prediction accuracy as the comparison metric. ResultsFeatures extracted from intrinsic mode functions combined with audio features significantly improve accuracy in comparison to using only audio features. The improvements of detection accuracy obtained with a selected set of combined features spanned from 3.8% to even 10.3% based on the employed classifier. ConclusionsWe have utilized empirical mode decomposition as a method of extracting features relevant for fetal heartbeat classification. The results show consistent improvements in detection accuracy when these characteristics are added to a set of conventional audio features. This implies substantial benefits of applying empirical mode decomposition and lays the groundwork for future research on fetal heartbeat detection.

Set Of Audio Features Research Articles

Related Topics

Articles published on Set Of Audio Features

Emergency Vehicle Classification Using Combined Temporal and Spectral Audio Features with Machine Learning Algorithms

Fall detection from audios with Audio Transformers

Singer Identification by Vocal Parts Detection and Singer Classification Using LSTM Neural Networks

COVID-19 Diagnosis from Crowdsourced Cough Sound Data

Audiovisual speaker indexing for Web-TV automations

Can empirical mode decomposition improve heartbeat detection in fetal phonocardiography signals?

Clustering analysis of crowd noise from collegiate basketball games

Prediction of three articulatory categories in vocal sound imitations using models for auditory receptive fields.

A Combined Motion-Audio School Bullying Detection Algorithm

Improved Audio Steganalytic Feature and Its Applications in Audio Forensics

Detecting Parkinson's disease from sustained phonation and speech signals.

Modeling Timbre Similarity of Short Music Clips.

Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination

TRADITIONAL MALAYSIAN MUSICAL GENRES CLASSIFICATION BASED ON THE ANALYSIS OF BEAT FEATURE IN AUDIO

Quantitative Study of Music Listening Behavior in a Smartphone Context

LIRIS-ACCEDE: A Video Database for Affective Content Analysis

Probabilistic Detection Methods for Acoustic Surveillance Using Audio Histograms

Drumkit simulator from everyday desktop objects

Combining Language Modeling and LSA on Greek Song “Words” for Mood Classification

On Computer-Assisted Orchestration

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Set Of Audio Features Research Articles

Related Topics

Articles published on Set Of Audio Features

Emergency Vehicle Classification Using Combined Temporal and Spectral Audio Features with Machine Learning Algorithms

Fall detection from audios with Audio Transformers

Singer Identification by Vocal Parts Detection and Singer Classification Using LSTM Neural Networks

COVID-19 Diagnosis from Crowdsourced Cough Sound Data

Audiovisual speaker indexing for Web-TV automations

Can empirical mode decomposition improve heartbeat detection in fetal phonocardiography signals?

Clustering analysis of crowd noise from collegiate basketball games

Prediction of three articulatory categories in vocal sound imitations using models for auditory receptive fields.

A Combined Motion-Audio School Bullying Detection Algorithm

Improved Audio Steganalytic Feature and Its Applications in Audio Forensics

Detecting Parkinson's disease from sustained phonation and speech signals.

Modeling Timbre Similarity of Short Music Clips.

Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination

TRADITIONAL MALAYSIAN MUSICAL GENRES CLASSIFICATION BASED ON THE ANALYSIS OF BEAT FEATURE IN AUDIO

Quantitative Study of Music Listening Behavior in a Smartphone Context

LIRIS-ACCEDE: A Video Database for Affective Content Analysis

Probabilistic Detection Methods for Acoustic Surveillance Using Audio Histograms

Drumkit simulator from everyday desktop objects

Combining Language Modeling and LSA on Greek Song “Words” for Mood Classification

On Computer-Assisted Orchestration