Identification of Noisy Utterance Speech Signal using GA-Based Optimized 2D-MFCC Method and a Bispectrum Analysis

Benyamin Kusumoputro,Li Na,Agus Buono

doi:10.4236/jsea.2012.512b037

Abstract

One-dimensional Mel-Frequency Cepstrum Coefficients (1D-MFCC) in conjunction with a power spectrum analysis method is usually used as a feature extraction in a speaker identification system. However, as this one dimensional feature extraction subsystem shows low recognition rate for identifying an utterance speech signal under harsh noise conditions, we have developed a speaker identification system based on two-dimensional Bispectrum data that was theoretically more robust to the addition of Gaussian noise. As the processing sequence of ID-MFCC method could not be directly used for processing the two-dimensional Bispectrum data, in this paper we proposed a 2D-MFCC method as an extension of the 1D-MFCC method and the optimization of the 2D filter design using Genetic Algorithms. By using the 2D-MFCC method with the Bispectrum analysis method as the feature extraction technique, we then used Hidden Markov Model as the pattern classifier. In this paper, we have experimentally shows our developed methods for identifying an utterance speech signal buried with various levels of noise. Experimental result shows that the 2D-MFCC method without GA optimization has a comparable high recognition rate with that of 1D-MFCC method for utterance signal without noise addition. However, when the utterance signal is buried with Gaussian noises, the developed 2D-MFCC shows higher recognition capability, especially, when the 2D-MFCC optimized by Genetics Algorithms is utilized.

Highlights

Research on automatic speech and voice identification system has attracted much interest in the last few years, motivated by the growth of its applications in many areas such as in diagnosis of a rotor crack [1], classification of unknown radar targets [2], medical disease [3], and for personal and gender identification for security system [4,5]
As the processing sequence of ID-Mel-Frequency Cepstrum Coefficients (MFCC) method could not be directly used for processing the two-dimensional Bispectrum data, in this paper we proposed a 2D-MFCC method as an extension of the 1D-MFCC method and the optimization of the 2D filter design using Genetic Algorithms
We have developed the 2D-MFCC feature extraction method for processing the Bispectrum data from utterance speech signal

Summary

Introduction

Research on automatic speech and voice identification system has attracted much interest in the last few years, motivated by the growth of its applications in many areas such as in diagnosis of a rotor crack [1], classification of unknown radar targets [2], medical disease [3], and for personal and gender identification for security system [4,5]. Speaker based personal identification is the process of determining a registered speaker when an utterance speech signal is provided. In this machine-based speech identification, a gallery of speeches is firstly enrolled to the system and coded for subsequent searching. When an unidentified speech is fetched to the system, a thoroughly comparison with the each coded speech in the gallery, and the identification is accomplished when a suitable match occurs. The main function of a feature extraction subsystem is to transform the input utterance speech signal into a set of features, while a classifier subsystem have to identify and classify the speaker by comparing the extracted-features from his/her speech signal input with the ones from a set of known speakers database

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Software Engineering and Applications	Publication Date: Jan 1, 2012
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Identification of Noisy Utterance Speech Signal using GA-Based Optimized 2D-MFCC Method and a Bispectrum Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Software Engineering and Applications

Lead the way for us

Similar Papers

Evolution state identification of deep landslide displacement based on a quadratic wavelet reconstruction and bispectrum analysis method
Jingjing Long ... Yong Liu
-
Jingjing Long, et. al.Jingjing Long ... Yong Liu
27 Mar 2022
27 Mar 2022

Mel-Frequency Cepstrum Coeffficients as Higher Order Statistics Representation to Characterize Speech Signal for Speaker Identification System in Noisy Environment Using Hidden Markov Model
Agus Buono ... Wisnu Jatmiko
-
Agus Buono, et. al.Agus Buono ... Wisnu Jatmiko
21 Jan 2011
21 Jan 2011

Robust feature extraction for alphabet recognition
Montri Karnjanadecha ... Stephen A Zahorian
-
Montri Karnjanadecha, et. al.Montri Karnjanadecha ... Stephen A Zahorian
30 Nov 1998
30 Nov 1998

Fault Feature Extraction for Anchor Bolt Loosening of Escalator Based on EWT and Bispectrum Analysis
Zheng Yan
CONVERTER | VOL. -
Zheng YanZheng Yan
10 Jul 2021
CONVERTER | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identification of Noisy Utterance Speech Signal using GA-Based Optimized 2D-MFCC Method and a Bispectrum Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Software Engineering and Applications