Employing Second-Order Circular Suprasegmental Hidden Markov Models to Enhance Speaker Identification Performance in Shouted Talking Environments

Ismail Shahin

doi:10.1155/2010/862138

Abstract

Speaker identification performance is almost perfect in neutral talking environments; however, the performance is deteriorated significantly in shouted talking environments. This work is devoted to proposing, implementing and evaluating new models called Second-Order Circular Suprasegmental Hidden Markov Models (CSPHMM2s) to alleviate the deteriorated performance in the shouted talking environments. These proposed models possess the characteristics of both Circular Suprasegmental Hidden Markov Models (CSPHMMs) and Second-Order Suprasegmental Hidden Markov Models (SPHMM2s). The results of this work show that CSPHMM2s outperform each of: First-Order Left-to-Right Suprasegmental Hidden Markov Models (LTRSPHMM1s), Second-Order Left-to-Right Suprasegmental Hidden Markov Models (LTRSPHMM2s) and First-Order Circular Suprasegmental Hidden Markov Models (CSPHMM1s) in the shouted talking environments. In such talking environments and using our collected speech database, average speaker identification performance based on LTRSPHMM1s, LTRSPHMM2s, CSPHMM1s and CSPHMM2s is 74.6%, 78.4%, 78.7% and 83.4%, respectively. Speaker identification performance obtained based on CSPHMM2s is close to that obtained based on subjective assessment by human listeners.

Highlights

Speaker recognition is the process of automatically recognizing who is speaking on the basis of individual information embedded in speech signals
To evaluate the proposed models, speaker identification performance based on such models is compared separately with that based on each of LTRSPHMM1s, LTRSPHMM2s, and CSPHMM1s in the two talking environments
It is evident from this table that each of LTRSPHMM1s, LTRSPHMM2s, CSPHMM1s, and CSPHMM2s perform almost perfect in the neutral talking environments

Summary

Introduction

Speaker recognition is the process of automatically recognizing who is speaking on the basis of individual information embedded in speech signals. Speaker recognition involves two applications: speaker identification and speaker verification (authentication). Speaker identification is the process of finding the identity of the unknown speaker by comparing his/her voice with voices of registered speakers in the database. Speaker identification can be used in criminal investigations to determine the suspected persons who generated the voice recorded at the scene of the crime. Speaker identification can be used in civil cases or for the media. These cases include calls to radio stations, local or other government authorities, insurance companies, monitoring people by their voices, and many other applications

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Audio, Speech, and Music Processing	Publication Date: Jan 1, 2010
Citations: 41	License type: cc-by

R Discovery Prime

R Discovery Prime

Employing Second-Order Circular Suprasegmental Hidden Markov Models to Enhance Speaker Identification Performance in Shouted Talking Environments

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing

Lead the way for us

Similar Papers

Speaker Identification in a Shouted Talking Environment Based on Novel Third-Order Circular Suprasegmental Hidden Markov Models
Ismail M A Shahin
Circuits, Systems, and Signal Processing | VOL. 35
Ismail M A ShahinIsmail M A Shahin
30 Dec 2015
Circuits, Systems, and Signal Processing | VOL. 35

Speaker identification in emotional talking environments based on CSPHMM2s
Ismail Shahin
Engineering Applications of Artificial Intelligence | VOL. 26
Ismail ShahinIsmail Shahin
26 Apr 2013
Engineering Applications of Artificial Intelligence | VOL. 26

Emirati-accented speaker identification in each of neutral and shouted talking environments
Ismail Shahin ... Mohammed Bahutair
International Journal of Speech Technology | VOL. 21
Ismail Shahin, et. al.Ismail Shahin ... Mohammed Bahutair
28 Mar 2018
International Journal of Speech Technology | VOL. 21

Talking condition recognition in stressful and emotional talking environments based on CSPHMM2s
Ismail Shahin ... Mohammed Nasser Ba-Hutair
International Journal of Speech Technology | VOL. 18
Ismail Shahin, et. al.Ismail Shahin ... Mohammed Nasser Ba-Hutair
31 Aug 2014
International Journal of Speech Technology | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Employing Second-Order Circular Suprasegmental Hidden Markov Models to Enhance Speaker Identification Performance in Shouted Talking Environments

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing