Speaker Age and Gender Estimation Based on Deep Learning Bidirectional Long-Short Term Memory (BiLSTM)

Aalaa Ahmed Mohammed Aalaa Ahmed Mohammed,Yusra Faisal Al-Irhayim Yusra Faisal Al-Irhayim

doi:10.25130/tjps.v26i4.166

Aalaa Ahmed Mohammed Aalaa Ahmed Mohammed, Yusra Faisal Al-Irhayim Yusra Faisal Al-Irhayim

Open Access

https://doi.org/10.25130/tjps.v26i4.166

Copy DOI

Journal: Tikrit Journal of Pure Science	Publication Date: Jul 10, 2021
License type: CC BY 4.0

Abstract

Estimating the age and gender of the speaker has gained great importance in recent years due to its necessity in various commercial, medical and forensic applications. This work estimates the speakers gender and ages in small range of years where every ten years has been divided into two subcategories for a span of years extending from teens to sixties. A system of speaker age and gender estimation uses Mel Frequency Cepstrum Coefficient (MFCC) as a features extraction method, and Bidirectional Long-Short Term Memory (BiLSTM) as a classification method. Two models of two deep neural networks were building, one for speaker age estimation, and the other for speaker gender estimation. The experimental results show that the deep neural network model of age estimation achieves 94.008 % as accuracy rate, while the deep neural network model of gender estimation achieves 90.816% as accuracy rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker Age and Gender Estimation Based on Deep Learning Bidirectional Long-Short Term Memory (BiLSTM)

Abstract

Talk to us

Similar Papers

More From: Tikrit Journal of Pure Science

Lead the way for us

Similar Papers

A Comparison of Human and Machine Estimation of Speaker Age
Mark Huckvale ... Aimee Webb
-
Mark Huckvale, et. al.Mark Huckvale ... Aimee Webb
01 Jan 2015
01 Jan 2015

Deep Recurrent Neural Networks for Automatic Detection of Sleep Apnea from Single Channel Respiration Signals.
Hisham Elmoaqet ... Thomas Penzel
Sensors | VOL. 20
Hisham Elmoaqet, et. al.Hisham Elmoaqet ... Thomas Penzel
04 Sep 2020
Sensors | VOL. 20

Age estimation from voice in the Cantonese elderly population: influence of listener’s age and stimulus types
Estella P.-M Ma ... Michelle C.-K Wu
Speech, Language and Hearing | VOL. 23
Estella P.-M Ma, et. al.Estella P.-M Ma ... Michelle C.-K Wu
26 Jun 2019
Speech, Language and Hearing | VOL. 23

Voice-based age, gender, and language recognition based on ResNet deep model and transfer learning in spectro-temporal domain
Samira Mavaddati
Neurocomputing | VOL. 580
Samira MavaddatiSamira Mavaddati
28 Feb 2024
Neurocomputing | VOL. 580

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker Age and Gender Estimation Based on Deep Learning Bidirectional Long-Short Term Memory (BiLSTM)

Abstract

Talk to us

Similar Papers

More From: Tikrit Journal of Pure Science