SPOKEN-DIGIT CLASSIFICATION USING ARTIFICIAL NEURAL NETWORK

Aunhel John M. Adoptante,Patrick Kendrex L. Lucero,Rhowel M. Dellosa,Arnie M. Baes,John Carlo A. Catilo,Anton Louise P. De Ocampo,Alvin S. Alon

doi:10.11113/aej.v13.18388

Abstract

Audio classification has been one of the most popular applications of Artificial Neural Networks. This process is at the center of modern AI technology, such as virtual assistants, automatic speech recognition, and text-to-speech applications. There have been studies about spoken digit classification and its applications. However, to the best of the author's knowledge, very few works focusing on English spoken digit recognition that implemented ANN classification have been done. In this study, the authors utilized the Mel-Frequency Cepstral Coefficients (MFCC) features of the audio recording and Artificial Neural Network (ANN) as the classifier to recognize the spoken digit by the speaker. The Audio MNIST dataset was used as training and test data while the Free-Spoken Digit Dataset was used as additional validation data. The model showed an F-1 score of 99.56% accuracy for the test data and an F1 score of 81.92% accuracy for the validation data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SPOKEN-DIGIT CLASSIFICATION USING ARTIFICIAL NEURAL NETWORK

Abstract

Talk to us

Similar Papers

More From: ASEAN Engineering Journal

Lead the way for us

Journal: ASEAN Engineering Journal	Publication Date: Feb 28, 2023
Citations: 1

Similar Papers

Novel Gammatone Filterbank Based Spectro-Temporal Features for Robust Phoneme Recognition
Ankit Nagpal ... Hemant A Patil
-
Ankit Nagpal, et. al.Ankit Nagpal ... Hemant A Patil
01 Jan 2017
01 Jan 2017

Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems
Md Jahangir Alam ... Patrick Kenny
Cognitive Computation | VOL. 5
Md Jahangir Alam, et. al.Md Jahangir Alam ... Patrick Kenny
07 Dec 2012
Cognitive Computation | VOL. 5

Robust speech recognition under noisy environments using asymmetric tapers
...
-
, et. al. ...
18 Oct 2012
18 Oct 2012

Subband feature extraction using lapped orthogonal transform for speech recognition
Z Tufekci ... J.N Gowdy
-
Z Tufekci, et. al.Z Tufekci ... J.N Gowdy
07 May 2001
07 May 2001

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SPOKEN-DIGIT CLASSIFICATION USING ARTIFICIAL NEURAL NETWORK

Abstract

Talk to us

Similar Papers

More From: ASEAN Engineering Journal