Detecting Lombard Speech Using Deep Learning Approach.

Krzysztof Kąkol,Gintautas Tamulevičius,Bożena Kostek,Gražina Korvel

doi:10.3390/s23010315

Krzysztof Kąkol, Gintautas Tamulevičius + Show 2 more

Open Access

https://doi.org/10.3390/s23010315

Copy DOI

Abstract

Robust Lombard speech-in-noise detecting is challenging. This study proposes a strategy to detect Lombard speech using a machine learning approach for applications such as public address systems that work in near real time. The paper starts with the background concerning the Lombard effect. Then, assumptions of the work performed for Lombard speech detection are outlined. The framework proposed combines convolutional neural networks (CNNs) and various two-dimensional (2D) speech signal representations. To reduce the computational cost and not resign from the 2D representation-based approach, a strategy for threshold-based averaging of the Lombard effect detection results is introduced. The pseudocode of the averaging process is also included. A series of experiments are performed to determine the most effective network structure and the 2D speech signal representation. Investigations are carried out on German and Polish recordings containing Lombard speech. All 2D signal speech representations are tested with and without augmentation. Augmentation means using the alpha channel to store additional data: gender of the speaker, F0 frequency, and first two MFCCs. The experimental results show that Lombard and neutral speech recordings can clearly be discerned, which is done with high detection accuracy. It is also demonstrated that the proposed speech detection process is capable of working in near real-time. These are the key contributions of this work.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Dec 28, 2022
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Detecting Lombard Speech Using Deep Learning Approach.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

The Lombard Effect as a Communicative Phenomenon
Priscilla Lau
UC Berkeley Phonology Lab Annual Reports | VOL. 4
Priscilla LauPriscilla Lau
01 Jan 2008
UC Berkeley Phonology Lab Annual Reports | VOL. 4

Analysis of lombard and angry speech using Gaussian Mixture Models and KL divergence
Shubham Mittal ... Swati Vyas
-
Shubham Mittal, et. al.Shubham Mittal ... Swati Vyas
01 Feb 2013
01 Feb 2013

Understanding Lombard speech: a review of compensation techniques towards improving speech based recognition systems
S Uma Maheswari ... A Nayeemulla Khan
Artificial Intelligence Review | VOL. 54
S Uma Maheswari, et. al.S Uma Maheswari ... A Nayeemulla Khan
18 Sep 2020
Artificial Intelligence Review | VOL. 54

Identifying Issues in Estimating Parameters from Speech Under Lombard Effect
M Aiswarya ... D Govind
-
M Aiswarya, et. al.M Aiswarya ... D Govind
27 Sep 2017
27 Sep 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting Lombard Speech Using Deep Learning Approach.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)