Analysis and recognition of whispered speech

Taisuke Ito,Kazuya Takeda,Fumitada Itakura

doi:10.1016/j.specom.2003.10.005

Abstract

In this study, we have examined the acoustic characteristics of whispered speech and addressed some of the issues involved in recognition of whispered speech used for communication over a mobile phone in a noisy environment. The acoustic analysis shows that there is an upward shift of formant frequencies of vowels as observed in the whispered speech data compared to the normal speech data. Voiced consonants in the whispered speech have lower energy at low frequencies up to 1.5 kHz and their spectral flatness is greater compared to the normal speech. In experiments on whispered speech recognition, results of our studies on adaptation of the whispered speech models have shown that adaptation using a small amount of whispered speech data from a target speaker can be effectively used for recognition of the whispered speech. In a noisy environment, the recognition accuracy decreases significantly for the whispered speech compared to the normal speaking of the same speech. A method to increase the SNR by covering the mouth with a hand has been shown to give a higher recognition accuracy for the whispered speech frequently encountered for private communication in a noisy environment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Analysis and recognition of whispered speech

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Sep 11, 2004
Citations: 163

Similar Papers

Using deep learning to improve the intelligibility of a target speaker in noisy multi-talker environments for people with normal hearing and hearing loss.
Iordanis Thoidis ... Tobias Goehring
The Journal of the Acoustical Society of America | VOL. 156
Iordanis Thoidis, et. al.Iordanis Thoidis ... Tobias Goehring
01 Jul 2024
The Journal of the Acoustical Society of America | VOL. 156

Unsupervised acoustic model adaptation algorithm using MLLR in a noisy environment
Miichi Yamada ... Hiroshi Saruwatari
Electronics and Communications in Japan (Part III: Fundamental Electronic Science) | VOL. 89
Miichi Yamada, et. al.Miichi Yamada ... Hiroshi Saruwatari
10 Nov 2005
Electronics and Communications in Japan (Part III: Fundamental Electronic Science) | VOL. 89

An Electroglottograph Auxiliary Neural Network for Target Speaker Extraction
Lijiang Chen ... Chunfeng Cui
Applied Sciences | VOL. 13
Lijiang Chen, et. al.Lijiang Chen ... Chunfeng Cui
29 Dec 2022
Applied Sciences | VOL. 13

Stable High- Tc rf SQUID NDE system operating in noisy environment
D.F He ... H Itozaki
Physica C: Superconductivity and its applications | VOL. 436
D.F He, et. al.D.F He ... H Itozaki
10 Mar 2006
Physica C: Superconductivity and its applications | VOL. 436

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis and recognition of whispered speech

Abstract

Talk to us

Similar Papers

More From: Speech Communication