Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks

Ruben Zazo,Joaquin Gonzalez-Rodriguez,Najim Dehak,Phani Sankar Nidadavolu,Nanxin Chen

doi:10.1109/access.2018.2816163

Ruben Zazo, Joaquin Gonzalez-Rodriguez + Show 3 more

Open Access

https://doi.org/10.1109/access.2018.2816163

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2018
Citations: 98	License type: cc-by-nc-nd

Affiliation: Johns Hopkins University

Abstract

Age estimation from speech has recently received increased interest as it is useful for many applications such as user-profiling, targeted marketing, or personalized call-routing. This kind of applications need to quickly estimate the age of the speaker and might greatly benefit from real-time capabilities. Long short-term memory (LSTM) recurrent neural networks (RNN) have shown to outperform state-of-the-art approaches in related speech-based tasks, such as language identification or voice activity detection, especially when an accurate real-time response is required. In this paper, we propose a novel age estimation system based on LSTM-RNNs. This system is able to deal with short utterances (from 3 to 10 s) and it can be easily deployed in a real-time architecture. The proposed system has been tested and compared with a state-of-the-art i-vector approach using data from NIST speaker recognition evaluation 2008 and 2010 data sets. Experiments on short duration utterances show a relative improvement up to 28% in terms of mean absolute error of this new approach over the baseline system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Kazakh and Russian Languages Identification Using Long Short-Term Memory Recurrent Neural Networks
Zhanibek Kozhirbayev ... Zhandos Yessenbayev
-
Zhanibek Kozhirbayev, et. al.Zhanibek Kozhirbayev ... Zhandos Yessenbayev
01 Sep 2017
01 Sep 2017

Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks.
Ruben Zazo ... Joaquin Gonzalez-Rodriguez
PLOS ONE | VOL. 11
Ruben Zazo, et. al.Ruben Zazo ... Joaquin Gonzalez-Rodriguez
29 Jan 2016
PLOS ONE | VOL. 11

Long short-term memory (LSTM) recurrent neural network for low-flow hydrological time series forecasting
Bibhuti Bhusan Sahoo ... Deepak Kumar
Acta Geophysica | VOL. 67
Bibhuti Bhusan Sahoo, et. al.Bibhuti Bhusan Sahoo ... Deepak Kumar
20 Jul 2019
Acta Geophysica | VOL. 67

Long short-term memory recurrent neural network architectures for large scale acoustic modeling
Haşim Sak ... Andrew Senior
-
Haşim Sak, et. al.Haşim Sak ... Andrew Senior
14 Sep 2014
14 Sep 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks

Abstract

Talk to us

Similar Papers

More From: IEEE Access