Text-independent Speaker Identification Research Articles

A method and apparatus for securing access to a service or facility employing automatic speech recognition, text-independent speaker identification, natural language understanding techniques and additional dynamic and static features. The method includes the steps of receiving and decoding speech containing indicia of the speaker such as a name, address or customer number; accessing a database containing information on candidate speakers; questioning the speaker based on the information; receiving, decoding and verifying an answer to the question; obtaining a voice sample of the speaker and verifying the voice sample against a model; generating a score based on the answer and the voice sample; and granting access if the score is equal to or greater than a threshold. Alternatively, the method includes the steps of receiving and decoding speech containing indicia of the speaker; generating a sub-list of speaker candidates having indicia substantially matching the speaker; activating databases containing information about the speaker candidates in the sub-list; performing voice classification analysis; eliminating speaker candidates based on the voice classification analysis; questioning the speaker regarding the information; eliminating speaker candidates based on the answer; and iteratively repeating prior steps until one speaker candidate (in which case the speaker is granted access), or no speaker candidate remains (in which case the speaker is not granted access).

This paper addresses the issue of closed-set text-independent speaker identification from samples of speech recorded over the telephone. It focuses on the effects of acoustic mismatches between training and testing data, and concentrates on two approaches: (1) extracting features that are robust against channel variations and (2) transforming the speaker models to compensate for channel effects. First, an experimental study shows that optimizing the front end processing of the speech signal can significantly improve speaker recognition performance. A new filterbank design is introduced to improve the robustness of the speech spectrum computation in the front-end unit. Next, a new feature based on spectral slopes is described. Its ability to discriminate between speakers is shown to be superior to that of the traditional cepstrum. This feature can be used alone or combined with the cepstrum. The second part of the paper presents two model transformation methods that further reduce channel effects. These methods make use of a locally collected stereo database to estimate a speaker-independent variance transformation for each speech feature used by the classifier. The transformations constructed on this stereo database can then be applied to speaker models derived from other databases. Combined, the methods developed in this paper resulted in a 38% relative improvement on the closed-set 30-s training 5-s testing condition of the NIST'95 Evaluation task, after cepstral mean removal.

Text-independent Speaker Identification Research Articles

Related Topics

Articles published on Text-independent Speaker Identification

Codebook design using genetic algorithm and its application to speaker identification

A Model-Selection-Based Self-Splitting Gaussian Mixture Learning with Application to Speaker Identification

Pitch Correlogram Clustering for Fast Speaker Identification

On the Determination of Optimal Model Order for GMM-Based Text-Independent Speaker Identification

Speaker identification using hybrid Karhunen–Loeve transform and Gaussian mixture model approach

Speaker identification by combining multiple classifiers using Dempster–Shafer theory of evidence

Application of time-frequency principal component analysis to text-independent speaker identification

Forensic speaker identification based on spectral moments

Neural networks for improved text-independent speaker identification.

Robust speech features based on wavelet transform with application to speaker identification

Text-independent speaker identification based on explicit exploitation of stochastic characteristics of test utterance

Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases

Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification

Binary quantization of feature vectors for robust text-independent speaker identification

Robust text-independent speaker identification over telephone channels

Speaker identification through use of features selected using genetic algorithm

Methods of Combining Multiple Classifiers with Different Features and Their Applications to Text-Independent Speaker Identification

Second-order statistical measures for text-independent speaker identification

A Modular and Hybrid Connectionist System for Speaker Identification

Large population speaker identification using clean and telephone speech

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text-independent Speaker Identification Research Articles

Related Topics

Articles published on Text-independent Speaker Identification

Codebook design using genetic algorithm and its application to speaker identification

A Model-Selection-Based Self-Splitting Gaussian Mixture Learning with Application to Speaker Identification

Pitch Correlogram Clustering for Fast Speaker Identification

On the Determination of Optimal Model Order for GMM-Based Text-Independent Speaker Identification

Speaker identification using hybrid Karhunen–Loeve transform and Gaussian mixture model approach

Speaker identification by combining multiple classifiers using Dempster–Shafer theory of evidence

Application of time-frequency principal component analysis to text-independent speaker identification

Forensic speaker identification based on spectral moments

Neural networks for improved text-independent speaker identification.

Robust speech features based on wavelet transform with application to speaker identification

Text-independent speaker identification based on explicit exploitation of stochastic characteristics of test utterance

Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases

Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification

Binary quantization of feature vectors for robust text-independent speaker identification

Robust text-independent speaker identification over telephone channels

Speaker identification through use of features selected using genetic algorithm

Methods of Combining Multiple Classifiers with Different Features and Their Applications to Text-Independent Speaker Identification

Second-order statistical measures for text-independent speaker identification

A Modular and Hybrid Connectionist System for Speaker Identification

Large population speaker identification using clean and telephone speech