Text-Independent Speaker Identification Using Vowel Formants

Noor Almaadeed,Abbes Amira,Amar Aggoun

doi:10.1007/s11265-015-1005-5

Abstract

Automatic speaker identification has become a challenging research problem due to its wide variety of applications. Neural networks and audio-visual identification systems can be very powerful, but they have limitations related to the number of speakers. The performance drops gradually as more and more users are registered with the system. This paper proposes a scalable algorithm for real-time text-independent speaker identification based on vowel recognition. Vowel formants are unique across different speakers and reflect the vocal tract information of a particular speaker. The contribution of this paper is the design of a scalable system based on vowel formant filters and a scoring scheme for classification of an unseen instance. Mel-Frequency Cepstral Coefficients (MFCC) and Linear Predictive Coding (LPC) have both been analysed for comparison to extract vowel formants by windowing the given signal. All formants are filtered by known formant frequencies to separate the vowel formants for further processing. The formant frequencies of each speaker are collected during the training phase. A test signal is also processed in the same way to find vowel formants and compare them with the saved vowel formants to identify the speaker for the current signal. A score-based scheme allows the speaker with the highest matching formants to own the current signal. This model requires less than 100 bytes of data to be saved for each speaker to be identified, and can identify the speaker within a second. Tests conducted on multiple databases show that this score-based scheme outperforms the back propagation neural network and Gaussian mixture models. Usually, the longer the speech files, the more significant were the improvements in accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Signal Processing Systems	Publication Date: May 5, 2015
Citations: 52	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Text-Independent Speaker Identification Using Vowel Formants

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems

Lead the way for us

Similar Papers

Wavelet based dynamic Mel Frequency Cepstral Coefficients (MFCC) and block truncation techniques for efficient speaker identification under narrowband noise conditions
...
International Journal of the Physical Sciences | VOL. 8
, et. al. ...
23 Sep 2013
International Journal of the Physical Sciences | VOL. 8

GMM Classifier for Identification of Neurological Disordered Voices Using MFCC Features
...
-
, et. al. ...
21 Apr 2015
21 Apr 2015

Text-independent speaker identification system using discrete wavelet transform with linear prediction coding
Othman Alrusaini ... Khaled Daqrouq
Journal of Umm Al-Qura University for Engineering and Architecture | VOL. 15
Othman Alrusaini, et. al.Othman Alrusaini ... Khaled Daqrouq
09 Feb 2024
Journal of Umm Al-Qura University for Engineering and Architecture | VOL. 15

Real-time prediction of upcoming respiratory events via machine learning using snoring sound signal.
Bochun Wang ... Ji Wu
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17
Bochun Wang, et. al.Bochun Wang ... Ji Wu
12 Apr 2021
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text-Independent Speaker Identification Using Vowel Formants

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems