Abstract

This paper investigates the influence of GSM speech coding on the performance of a text independent Speaker Recognition System (SRS). The SRS developed perform recognition on reconstructed speech waveform from the coded parameters using Gaussian Mixture Models (GMM) technique. The performance evaluation due to the use of the GSM speech coding namely the GSMEFR (Global System Mobile Enhanced Full Rate) codec was conducted, using three transcoded databases, obtained by passing the local ARADIGIT database through the GSM coder/decoder. The recognition evaluation was also conducted using original ARADIGIT sampled at 16 KHz and its 8 KHz downsampled version. The ARADIGIT database consists of 60 speakers (31 male speakers and 29 female speakers) pronouncing the ten Arabic digits five time each. Several experiments were conducted in order to evaluate the degradation introduced by different aspects of the simulated coder.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call