Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling

Amarnag Subramanya,Zhengyou Zhang,Zicheng Liu,Alex Acero

doi:10.1016/j.specom.2007.09.002

Amarnag Subramanya, Zhengyou Zhang + Show 2 more

https://doi.org/10.1016/j.specom.2007.09.002

Copy DOI

Abstract

In this paper, we tackle the problem of speech enhancement from two fronts: speech modeling and multisensory input. We present a new speech model based on statistics of magnitude-normalized complex spectra of speech signals. By performing magnitude normalization, we are able to get rid of huge intra- and inter-speaker variation in speech energy and to build a better speech model with a smaller number of Gaussian components. To deal with real-world problems with multiple noise sources, we propose to use multiple heterogeneous sensors, and in particular, we have developed microphone headsets that combine a conventional air microphone and a bone sensor. The bone sensor makes direct contact with the speaker’s temple (area behind the ear), and captures the vibrations of the bones and skin during the process of vocalization. The signals captured by the bone microphone, though distorted, contain useful audio information, especially in the low frequency range, and more importantly, they are very robust to external noise sources (stationary or not). By fusing the bone channel signals with the air microphone signals, much improved speech signals have been obtained.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Sep 25, 2007
Citations: 52

Similar Papers

Speech Modelingwith Magnitude-Normalized Complex Spectra and Its Application to Multisensory Speech Enhancement
Amarnag Subramanya ... Alex Acero
-
Amarnag Subramanya, et. al.Amarnag Subramanya ... Alex Acero
01 Sep 2005
01 Sep 2005

Speech Enhancement and Recognition of Compressed Speech Signal in Noisy Reverberant Conditions
Maloji Suman ... M Madhavi Latha
-
Maloji Suman, et. al.Maloji Suman ... M Madhavi Latha
01 Jan 2012
01 Jan 2012

Spectrogram-based speech enhancement by spatial attention generative adversarial networks
Haixin Luo ... Yu Fu
-
Haixin Luo, et. al.Haixin Luo ... Yu Fu
12 Oct 2022
12 Oct 2022

The system of speech enhancement algorithm for blind source separation based on FastICA
Minsan Zhang
-
Minsan ZhangMinsan Zhang
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling

Abstract

Talk to us

Similar Papers

More From: Speech Communication