Human Age Estimation Through Audio Utilising MFCC and RNN

Wenripin Chandra,Irpan Adiputra Pardosi,Osfredo Quinn,Ken Ken

doi:10.33395/sinkron.v8i3.12656

Wenripin Chandra, Irpan Adiputra Pardosi + Show 2 more

Open Access

https://doi.org/10.33395/sinkron.v8i3.12656

Copy DOI

Journal: SinkrOn	Publication Date: Jul 30, 2023
License type: CC BY-NC 4.0

Affiliation: Pelita Harapan University

Abstract

Age is one of human main attributes. Age is important factor to improve communication experience. Age estimation has been used in several applications to improve user experience. Therefore, an approach is needed to estimate the user age, one of which is through audio. In this study, Mel Frequency Cepstrum Coefficients (MFCC) and Recurrent Neural Network (RNN) will be used to estimate age through audio. MFCC is used to get features from audio data, while RNN is used to estimate age. Dataset used here was taken from corpus of user speech data on the Common Voice website. This study shows that MFCC and RNN methods are able to estimate human age through audio with highest accuracy obtained in SimpleRNN is 0.5647, and 0.7087 in LSTM.

Full Text