Mel Frequency Cepstral Coefficients (MFCC) Method and Multiple Adaline Neural Network Model for Speaker Identification

Sudi Mariyanto Al Sasongko,Suthami Ariessaputra,Shofian Tsaury,Syafaruddin Ch

doi:10.30630/joiv.7.4.1376

Sudi Mariyanto Al Sasongko, Suthami Ariessaputra + Show 2 more

Open Access

https://doi.org/10.30630/joiv.7.4.1376

Copy DOI

Abstract

Speech recognition technology makes human contact with the computer more accessible. There are two phases in the speaker recognition process: capturing or extracting voice features and identifying the speaker's voice pattern based on the voice characteristics of each speaker. Speakers consist of men and women. Their voices are recorded and stored in a computer database. Mel Frequency Cepstrum Coefficients (MFCC) are used at the voice extraction stage with a characteristic coefficient of 13. MFCC is based on variations in the response of the human ear's critical range to frequencies (linear and logarithmic). The sound frame is converted to Mel frequency and processed with several triangular filters to get the cepstrum coefficient. Meanwhile, at the speech pattern recognition stage, the speaker uses an artificial neural network (ANN) Madaline model (many Adaline/ which is the plural form of Adaline) to compare the test sound characteristics. The training voice's features have been inputted as training data. The Madaline Neural Network training is BFGS Quasi-Newton Backpropagation with a goal parameter of 0,0001. The results obtained from the study prove that the Madaline model of artificial neural networks is not recommended for identification research. The results showed that the database's speech recognition rate reached 61% for ten tests. The test outside the database was rejected by only 14%, and 84% refused testing outside the database with different words from the training data. The results of this model can be used as a reference for creating an Android-based real-time system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mel Frequency Cepstral Coefficients (MFCC) Method and Multiple Adaline Neural Network Model for Speaker Identification

Abstract

Talk to us

Similar Papers

More From: JOIV : International Journal on Informatics Visualization

Lead the way for us

Journal: JOIV : International Journal on Informatics Visualization	Publication Date: Dec 3, 2023
License type: CC BY-SA 4.0

Similar Papers

Mel Frequency Cepstral Coefficients (MFCC) Method and Multiple Adaline Neural Network Model for Speaker Identification
Sudi Mariyanto Al Sasongko ... Suthami Ariessaputra
JOIV : International Journal on Informatics Visualization | VOL. 7
Sudi Mariyanto Al Sasongko, et. al.Sudi Mariyanto Al Sasongko ... Suthami Ariessaputra
03 Dec 2023
JOIV : International Journal on Informatics Visualization | VOL. 7

Artificial Intelligence Methods for Automatic Music Transcription using Isolated Notes in Real-Time
Jose Luis Oropeza Rodriguez ... Omar Velazquez Lopez
-
Jose Luis Oropeza Rodriguez, et. al.Jose Luis Oropeza Rodriguez ... Omar Velazquez Lopez
01 Oct 2018
01 Oct 2018

Leak Detection in Water Distribution Pipes Based on CNN with Mel Frequency Cepstral Coefficients
Wei-Yi Chuang ... Yao-Long Tsai
-
Wei-Yi Chuang, et. al.Wei-Yi Chuang ... Yao-Long Tsai
15 Mar 2019
15 Mar 2019

Mel Frequency Cepstral Coefficient and its Applications: A Review
Zrar Kh Abdul ... Abdulbasit K Al-Talabani
IEEE Access | VOL. 10
Zrar Kh Abdul, et. al.Zrar Kh Abdul ... Abdulbasit K Al-Talabani
01 Jan 2021
IEEE Access | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mel Frequency Cepstral Coefficients (MFCC) Method and Multiple Adaline Neural Network Model for Speaker Identification

Abstract

Talk to us

Similar Papers

More From: JOIV : International Journal on Informatics Visualization