Transfer learning for children's speech recognition

Rong Tong,Lei Wang,Bin Ma

doi:10.1109/ialp.2017.8300540

Abstract

Children's speech processing is more challenging than that of adults due to lacking of large scale children's speech corpora. With the developing of the physical speech organ, high inter speaker and intra speaker variabilities are observed in children's speech. On the other hand, data collection on children is difficult as children usually have short attention span and their language proficiency is limited. In this paper, we propose to improve children's automatic speech recognition performance with transfer learning technique. We compare two transfer learning approaches in enhancing children's speech recognition performance with adults' data. The first method is to perform acoustic model adaptation on the pre-trained adult model. The second is to train acoustic model with deep neural network based multi-task learning approach: the adults' and children's acoustic characteristics are learnt jointly in the shared hidden layers, while the output layers are optimized with different speaker groups. Our experiment results show that both transfer learning approaches are effective in transferring rich phonetic and acoustic information from adults' model to children model. The multi-task learning approach outperforms the acoustic adaptation approach. We further show that the speakers' acoustic characteristics in languages can also benefit the target language under the multi-task learning framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transfer learning for children's speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Analyzing pitch robustness of PMVDR and MFCC features for children's speech recognition
Shweta Ghai ... Rohit Sinha
-
Shweta Ghai, et. al.Shweta Ghai ... Rohit Sinha
01 Jul 2010
01 Jul 2010

Does visual speech provide release from perceptual masking in children?
Destinee M Halverson ... Kaylah Lalonde
The Journal of the Acoustical Society of America | VOL. 148
Destinee M Halverson, et. al.Destinee M Halverson ... Kaylah Lalonde
01 Sep 2020
The Journal of the Acoustical Society of America | VOL. 148

Effect of pitch enhancement in Punjabi children's speech recognition system under disparate acoustic conditions
Vivek Bhardwaj ... Vinay Kukreja
Applied Acoustics | VOL. 177
Vivek Bhardwaj, et. al.Vivek Bhardwaj ... Vinay Kukreja
26 Jan 2021
Applied Acoustics | VOL. 177

Fuzzy-based discriminative feature representation for children's speech recognition
Seyed Mostafa Mirhassani ... Hua-Nong Ting
Digital Signal Processing | VOL. 31
Seyed Mostafa Mirhassani, et. al.Seyed Mostafa Mirhassani ... Hua-Nong Ting
09 May 2014
Digital Signal Processing | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transfer learning for children's speech recognition

Abstract

Talk to us

Similar Papers