Abstract

Speaker identification systems perform well under the neutral talking condition; however, they suffer sharp degradation under the shouted talking condition. In this paper, the second-order hidden Markov models (HMM2s) have been used to improve the recognition performance of isolated-word text-dependent speaker identification systems under the shouted talking condition. Our results show that HMM2s significantly improve the speaker identification performance compared to the first-order hidden Markov models (HMM1s). The average speaker identification performance under the shouted talking condition based on HMM1s is. On the other hand, the average speaker identification performance based on HMM2s is.

Highlights

  • Hidden Markov model (HMM) is one of the most widely used modeling techniques in the fields of speech recognition and speaker recognition [7]

  • Our results show that using HMM2s in the training and testing phases of isolatedword text-dependent speaker identification systems under the shouted talking condition significantly improves the identification performance compared to that using HMM1s

  • This work shows that HMM2s significantly improve the recognition performance of isolated-word text-dependent speaker identification systems under the shouted talking condition

Read more

Summary

MOTIVATION

Stressful talking conditions are defined as talking conditions that cause a speaker to vary his/her production of speech from the neutral talking condition. The neutral talking condition is defined as the talking condition in which speech is produced assuming that the speaker is in a “quiet room” with no task obligations. Some talking conditions are designed to simulate speech produced by different speakers under real stressful talking conditions. Chen used six talking conditions to simulate speech under real stressful talking conditions [4]. These conditions are as follows: neutral, fast, loud, Lombard, soft, and shouted. The shouted talking condition can be defined as follows: when a speaker shouts, his/her object is to produce a very loud acoustic signal to increase either its range (distance) of transmission or its ratio to background noise

INTRODUCTION
BRIEF OVERVIEW OF HIDDEN MARKOV MODELS
SECOND-ORDER HIDDEN MARKOV MODELS
EXTENDED VITERBI AND BAUM-WELCH ALGORITHMS
SPEECH DATABASE
RESULTS
DISCUSSION AND CONCLUSIONS
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call