Glottal instants extraction from speech signal using Deep Feature Loss

Supritha M Shetty,K T Deepak,Suraj Durgesht

doi:10.1109/spcom55316.2022.9840808

Abstract

Electroglottograph (EGG) is a device used to measure the conductance between the vocal folds. The analysis of EGG signal has many applications in the literature such as speech-to-text synthesis, voice disorder analysis, emotion recognition, speaker verification, etc. Therefore, the EGG device is essential to record the vocal folds activity. Alternatively, a new method is proposed in this work to synthesize the EGG waveform from speech signal using a context aggregation convolutional neural network. The synthesis network is trained by accounting the deep feature losses obtained by comparing it with another network called the EGG classification network. The synthesized EGG signal needs to be characterized. During the voiced speech production, the instants at which the vocal folds attain complete closure are called glottal closure instants (GCIs). Likewise, the opening instants are called glottal opening instants (GOIs). Such instants are reliably measured using the EGG signal. The performance of the proposed method is compared with other state-of-the-art techniques. The CMU-Arctic database has a parallel corpus of speech and EGG signal recorded simultaneously. This database is used for training the synthesis network and for comparison purposes. It is found that the performance of extracting glottal instants from synthesized EGG signals is comparable to other methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Glottal instants extraction from speech signal using Deep Feature Loss

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Accurate Estimation of Glottal Closure Instants and Glottal Opening Instants from Electroglottographic Signal Using Variational Mode Decomposition
G Jyothish Lal ... E A Gopalakrishnan
Circuits, Systems, and Signal Processing | VOL. 37
G Jyothish Lal, et. al.G Jyothish Lal ... E A Gopalakrishnan
30 May 2017
Circuits, Systems, and Signal Processing | VOL. 37

Deep Learning Based Pathological Voice Detection Algorithm Using Speech and Electroglottographic (EGG) Signals
Rumana Islam ... Mohammed Tarique
-
Rumana Islam, et. al.Rumana Islam ... Mohammed Tarique
23 Nov 2022
23 Nov 2022

Reference and automatic marking of glottal opening instants using EGG signal
Deepak Thotappa ... S R Mahadeva Prasanna
-
Deepak Thotappa, et. al.Deepak Thotappa ... S R Mahadeva Prasanna
01 Jul 2014
01 Jul 2014

Robust glottal activity detection using the phase of an electroglottographic signal
S.B Sunil Kumar ... K Sreenivasa Rao
Biomedical Signal Processing and Control | VOL. 36
S.B Sunil Kumar, et. al.S.B Sunil Kumar ... K Sreenivasa Rao
07 Apr 2017
Biomedical Signal Processing and Control | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Glottal instants extraction from speech signal using Deep Feature Loss

Abstract

Talk to us

Similar Papers