MMHFNet: Multi-modal and multi-layer hybrid fusion network for voice pathology detection

Hussein M.A Mohammed,Asli Nur Omeroglu,Emin Argun Oral

doi:10.1016/j.eswa.2023.119790

Abstract

Automatic voice pathology detection using non-invasive techniques that utilize patients’ speech and electroglottograph (EGG) signals play a vital role in diagnosis and early medical intervention. In this paper, a novel deep Multi-Modal and Multi-Layer Hybrid Fusion Network (MMHFNet) is proposed to improve the performance of non-invasive voice pathology detection systems. MMHFNet simultaneously incorporates complementary information of different modalities (speech and EGG signals). It also vertically combines the low-level features, extracted from shallow layers, and high-level features, extracted from deep layers, to take the full advantage of spatio-spectral information of different layers for multi-layer fusion. The features extracted by MMHFNet are then fed into an LSTM classification network to diagnose the voice pathology. Comprehensive experiments are conducted on the publicly available Saarbruecken Voice Database (SVD) to evaluate the performance of the proposed MMHFNet. This dataset is used in two manners; one using its all samples and the other with selected samples to form the largest balanced SVD dataset. Experimental results demonstrated that the proposed MMHFNet achieves accuracy rates of 91% and 96.05% for datasets with all and balanced samples, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MMHFNet: Multi-modal and multi-layer hybrid fusion network for voice pathology detection

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Mar 14, 2023
Citations: 10

Similar Papers

Multi-modal voice pathology detection architecture based on deep and handcrafted feature fusion
Asli Nur Omeroglu ... Emin Argun Oral
Engineering Science and Technology, an International Journal | VOL. 36
Asli Nur Omeroglu, et. al.Asli Nur Omeroglu ... Emin Argun Oral
01 Apr 2022
Engineering Science and Technology, an International Journal | VOL. 36

Deep Learning Based Pathological Voice Detection Algorithm Using Speech and Electroglottographic (EGG) Signals
Rumana Islam ... Esam Abdel-Raheem
-
Rumana Islam, et. al.Rumana Islam ... Esam Abdel-Raheem
23 Nov 2022
23 Nov 2022

Voice pathology detection using convolutional neural networks with electroglottographic (EGG) and speech signals
Rumana Islam ... Mohammed Tarique
Computer Methods and Programs in Biomedicine Update | VOL. 2
Rumana Islam, et. al.Rumana Islam ... Mohammed Tarique
01 Jan 2021
Computer Methods and Programs in Biomedicine Update | VOL. 2

Voice pathology detection and classification from speech signals and EGG signals based on a multimodal fusion method.
Lei Geng ... Wei Wang
Biomedizinische Technik. Biomedical engineering | VOL. 66
Lei Geng, et. al.Lei Geng ... Wei Wang
29 Nov 2021
Biomedizinische Technik. Biomedical engineering | VOL. 66

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MMHFNet: Multi-modal and multi-layer hybrid fusion network for voice pathology detection

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications