High performance text-independent speaker recognition system based on voiced/unvoiced segmentation and multiple neural nets

Nikos Fakotakis,George Kokkinakis,John Sirigos

doi:10.21437/eurospeech.1999-239

Abstract

This paper presents a text-independent speaker recognition system based on the voiced segments of the speech signal. The proposed system uses feedforward MLP classification with only a limited amount of training and testing data and gives a comparatively high accuracy. The techniques employed are: the Rasta-PLP speech analysis for parameter estimation, a feedforward MLP for voiced/unvoiced segmentation and a large number (equal to the number of speakers) of simple MLPs for the classification procedure. The system has been trained and tested using TIMIT and NTIMIT databases. The verification experiments presented a high accuracy rate: above 99% for clean speech (TIMIT) and 74.7%, for noisy speech (NTIMIT). Additional experiments were performed comparing the proposed approach of using voiced segments with only vowels and all phonetic categories with results favorable to the use of voiced segments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

High performance text-independent speaker recognition system based on voiced/unvoiced segmentation and multiple neural nets

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

<title>Large population speaker recognition using wideband and telephone speech</title>
Douglas A Reynolds
-
Douglas A ReynoldsDouglas A Reynolds
25 Oct 1994
25 Oct 1994

Spectral amplitude nonlinearities for improved noise robustness of spectral features for use in automatic speech recognition
Stephen Zahorian ... Brian Wong
The Journal of the Acoustical Society of America | VOL. 130
Stephen Zahorian, et. al.Stephen Zahorian ... Brian Wong
01 Oct 2011
The Journal of the Acoustical Society of America | VOL. 130

Speaker identification and verification using Gaussian mixture speaker models
Douglas A Reynolds
Speech Communication | VOL. 17
Douglas A ReynoldsDouglas A Reynolds
01 Aug 1995
Speech Communication | VOL. 17

The effects of telephone transmission degradations on speaker recognition performance
D.A Reynolds ... T.F Quatieri
-
D.A Reynolds, et. al.D.A Reynolds ... T.F Quatieri
09 May 1995
09 May 1995

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High performance text-independent speaker recognition system based on voiced/unvoiced segmentation and multiple neural nets

Abstract

Talk to us

Similar Papers