Performance Comparison of Multiple Speech Features for Speaker Recognition using Artifical Neural Network

Mary Saniya Rozario,Dominic Mathew,Abraham Thomas

doi:10.1109/icacc48162.2019.8986182

Abstract

Human speech is the natural way of communication between humans. Speech signal contains among others, information that conveys the message being spoken by a person, speaker characteristics and language. It also carries information regarding the speaker’s identity. The objective of Speaker Recognition Systems (SRS) is to automatically identify the speaker using the features extracted from their speech signals. Currently, SRS is one of the most popular biometric technique. It is used for surveillance, forensic speaker recognition, authentication, and such similar activities. In this work, Artifical Neural Network (ANN) based text-independent speaker identification is done. We compare the performance of Relative Spectral Amplitude-Perceptual Linear Prediction (RASTA-PLP), Mel-Frequency Cepstral Coefficient (MFCC) and Power Normalized Cepstral Coefficient (PNCC) features. To improve the performance accuracy of the system we extract features from the voiced frames of the signal. It was observed that MFCC outperforms the other two features namely PNCC and RASTA-PLP. TIMIT database was used for the experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Comparison of Multiple Speech Features for Speaker Recognition using Artifical Neural Network

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Robust Speaker Verification Using Improved PNCC Based on GMM-UBM
Xinxing Jing ... Bingwei Xiang
International Journal of Automation and Power Engineering | VOL. 4
Xinxing Jing, et. al.Xinxing Jing ... Bingwei Xiang
01 Jan 2015
International Journal of Automation and Power Engineering | VOL. 4

Speaker verification from codec distorted speech for forensic investigation through serial combination of classifiers
M.S Athulya ... P.S Sathidevi
Digital Investigation | VOL. 25
M.S Athulya, et. al.M.S Athulya ... P.S Sathidevi
31 Mar 2018
Digital Investigation | VOL. 25

Chapter 7 - Closed-set speaker identification system based on MFCC and PNCC features combination with different fusion strategies
Musab T.S Al-Kaltakchi ... Satnam S Dlay
Applied Speech Processing | VOL. -
Musab T.S Al-Kaltakchi, et. al.Musab T.S Al-Kaltakchi ... Satnam S Dlay
01 Jan 2020
Applied Speech Processing | VOL. -

PNCC for forensic automatic speaker recognition
Betty Kurian ... Leena Mary
-
Betty Kurian, et. al.Betty Kurian ... Leena Mary
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Comparison of Multiple Speech Features for Speaker Recognition using Artifical Neural Network

Abstract

Talk to us

Similar Papers