Abstract

This Letter proposes a deep neural network (DNN) adaptation method, herein referred to as the hidden variability subspace (HVS) method, to achieve improved robustness under diverse acoustic environments arising due to differences in conditions, e.g. speaker, channel, duration and environmental noise. In the proposed approach, a set of condition-dependent parameters is estimated to adapt the hidden layer weights of the DNN in the HVS to reduce the condition mismatch. These condition-dependent parameters are then connected to various layers through a new set of adaptively trained weights. The authors evaluate the proposed hidden variability learning method on a language identification task and show that significant performance gains can be obtained by discriminatively estimating a set of adaptation parameters to compensate the mismatch in the trained model.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call