Zero-crossing Rate Research Articles

Speech emotion recognition (SER) is a technology that can be applied to distance education to analyze speech patterns and evaluate speakers’ emotional states in real time. It provides valuable insights and can be used to enhance students’ learning experiences by enabling the assessment of their instructors’ emotional stability, a factor that significantly impacts the effectiveness of information delivery. Students demonstrate different engagement levels during learning activities, and assessing this engagement is important for controlling the learning process and improving e-learning systems. An important aspect that may influence student engagement is their instructors’ emotional state. Accordingly, this study used deep learning techniques to create an automated system for recognizing instructors’ emotions in their speech when delivering distance learning. This methodology entailed integrating transformer, convolutional neural network, and long short-term memory architectures into an ensemble to enhance the SER. Feature extraction from audio data used Mel-frequency cepstral coefficients; chroma; a Mel spectrogram; the zero-crossing rate; spectral contrast, centroid, bandwidth, and roll-off; and the root-mean square, with subsequent optimization processes such as adding noise, conducting time stretching, and shifting the audio data. Several transformer blocks were incorporated, and a multi-head self-attention mechanism was employed to identify the relationships between the input sequence segments. The preprocessing and data augmentation methodologies significantly enhanced the precision of the results, with accuracy rates of 96.3%, 99.86%, 96.5%, and 85.3% for the Ryerson Audio–Visual Database of Emotional Speech and Song, Berlin Database of Emotional Speech, Surrey Audio–Visual Expressed Emotion, and Interactive Emotional Dyadic Motion Capture datasets, respectively. Furthermore, it achieved 83% accuracy on another dataset created for this study, the Saudi Higher-Education Instructor Emotions dataset. The results demonstrate the considerable accuracy of this model in detecting emotions in speech data across different languages and datasets.

Read full abstract

The Internet of Bio-Nano Things concept (IoBNT) emerged from the need to establish connections between biological nanomachines, the intra-body nanonetwork, and the cyber internet to facilitate information exchange. While extensive research has concentrated on optimizing communication efficiency among nanodevices within networks, challenges such as IoBNT security and the interface linking nanonetwork to the internet have remained unaddressed. Consequently, this study introduces a privacy scheme designed to operate atop the Physical Cyber Interface (pHCI) within the IoBNT framework. Our proposed chaotic system derives its foundation from the command signals issued by medical personnel to pHCI devices implanted within the human body. It employs a concealed version of features generated through a Modified Quadratic Map (MQM) to enhance the privacy of patient information and to ensure a precise dosage release. Additionally, our scheme incorporates Binary Phase Shifting Key (BPSK) modulation through the incorporation of a carrier wave, along with feature extraction with zero-crossing rates. This privacy scheme significantly amplifies the key space, thereby guaranteeing an accurate right dose release with the protection of patient privacy. To assess the performance of our proposed scheme, we evaluate its operation on top of the pHCI device using various performance metrics. Subsequently, we study its performance by employing multi-compartmental models in both the forward and reverse pHCI directions of the IoBNT paradigm. The results from our simulation model clearly illustrate that the IoBNT-based privacy scheme has potential to enhance the delivery of therapeutic drugs to target cells while effectively addressing privacy concerns. An evaluation of performance metrics for two binary codes (thermal and light) reveals sensitivity and specificity rates of 95.333% and 95%, 100%, and 100%, respectively. Furthermore, the performance of our proposed privacy scheme, as measured by EER, accuracy, NPV, and PPV, has proven to be highly satisfactory. Hence, our proposed scheme makes significant role in enhancing the security of the physical cyber interface device while remaining cost-effective, and ensuring the safety of patients' life and confidentiality.

Read full abstract

Zero-crossing Rate Research Articles

Related Topics

Articles published on Zero-crossing Rate

Emergency Vehicle Classification Using Combined Temporal and Spectral Audio Features with Machine Learning Algorithms

Toward Wearables for Bruxism Detection: Voluntary Oral Behaviors Sound Recorded Across the Head Depend on Transducer Placement.

A Novel Short-Term PM2.5 Forecasting Approach Using Secondary Decomposition and a Hybrid Deep Learning Model

Multifeature Fusion Method with Metaheuristic Optimization for Automated Voice Pathology Detection

Novel combustion instability diagnosis method in a hydrogen/natural gas co-firing gas turbine combustor using a combination of four criteria: Temporal kurtosis, permutation entropy, energy of entropy, and zero-crossing rate

CNC Mechanical Machine and Musical Sound Analysis of Zero Crossing Rates (ZCR) by Artificial Intelligence Based Tools.

Speech emotion recognition based on multi-feature speed rate and LSTM

Failure analysis in predictive maintenance: Belt drive diagnostics with expert systems and Taguchi method for unconventional vibration features

The Impact of Contact Force on Signal Quality Indices in Photoplethysmography Measurements

Assessment of Pepper Robot's Speech Recognition System through the Lens of Machine Learning.

Combining Transformer, Convolutional Neural Network, and Long Short-Term Memory Architectures: A Novel Ensemble Learning Technique That Leverages Multi-Acoustic Features for Speech Emotion Recognition in Distance Education Classrooms

Feature fusion method for pulmonary tuberculosis patient detection based on cough sound.

Marathi Speech Emotion recognition using Deep Learning techniques.

Deep Learning for Arabic Speech Recognition Using Convolutional Neural Networks

Sex identification of ducklings based on acoustic signals

Detecting wire breaks in prestressed concrete pipes: an easy-to-install distributed fibre acoustic sensing approach

Internet of Bio-NanoThings privacy: securing a multi compartmental targeted cancer drug delivery scheme

Effective modelling of human expressive states from voice by adaptively tuning the neuro-fuzzy inference system

Machine learning-based infant crying interpretation.

Speech Emotion Recognition Model Using Deep Learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Zero-crossing Rate Research Articles

Related Topics

Articles published on Zero-crossing Rate

Emergency Vehicle Classification Using Combined Temporal and Spectral Audio Features with Machine Learning Algorithms

Toward Wearables for Bruxism Detection: Voluntary Oral Behaviors Sound Recorded Across the Head Depend on Transducer Placement.

A Novel Short-Term PM2.5 Forecasting Approach Using Secondary Decomposition and a Hybrid Deep Learning Model

Multifeature Fusion Method with Metaheuristic Optimization for Automated Voice Pathology Detection

Novel combustion instability diagnosis method in a hydrogen/natural gas co-firing gas turbine combustor using a combination of four criteria: Temporal kurtosis, permutation entropy, energy of entropy, and zero-crossing rate

CNC Mechanical Machine and Musical Sound Analysis of Zero Crossing Rates (ZCR) by Artificial Intelligence Based Tools.

Speech emotion recognition based on multi-feature speed rate and LSTM

Failure analysis in predictive maintenance: Belt drive diagnostics with expert systems and Taguchi method for unconventional vibration features

The Impact of Contact Force on Signal Quality Indices in Photoplethysmography Measurements

Assessment of Pepper Robot's Speech Recognition System through the Lens of Machine Learning.

Combining Transformer, Convolutional Neural Network, and Long Short-Term Memory Architectures: A Novel Ensemble Learning Technique That Leverages Multi-Acoustic Features for Speech Emotion Recognition in Distance Education Classrooms

Feature fusion method for pulmonary tuberculosis patient detection based on cough sound.

Marathi Speech Emotion recognition using Deep Learning techniques.

Deep Learning for Arabic Speech Recognition Using Convolutional Neural Networks

Sex identification of ducklings based on acoustic signals

Detecting wire breaks in prestressed concrete pipes: an easy-to-install distributed fibre acoustic sensing approach

Internet of Bio-NanoThings privacy: securing a multi compartmental targeted cancer drug delivery scheme

Effective modelling of human expressive states from voice by adaptively tuning the neuro-fuzzy inference system

Machine learning-based infant crying interpretation.

Speech Emotion Recognition Model Using Deep Learning