Recurrent Poisson Process Unit for Speech Recognition

Hengguan Huang,Hao Wang,Brian Mak

doi:10.1609/aaai.v33i01.33016538

Abstract

Over the past few years, there has been a resurgence of interest in using recurrent neural network-hidden Markov model (RNN-HMM) for automatic speech recognition (ASR). Some modern recurrent network models, such as long shortterm memory (LSTM) and simple recurrent unit (SRU), have demonstrated promising results on this task. Recently, several scientific perspectives in the fields of neuroethology and speech production suggest that human speech signals may be represented in discrete point patterns involving acoustic events in the speech signal. Based on this hypothesis, it may pose some challenges for RNN-HMM acoustic modeling: firstly, it arbitrarily discretizes the continuous input into the interval features at a fixed frame rate, which may introduce discretization errors; secondly, the occurrences of such acoustic events are unknown. Furthermore, the training targets of RNN-HMM are obtained from other (inferior) models, giving rise to misalignments. In this paper, we propose a recurrent Poisson process (RPP) which can be seen as a collection of Poisson processes at a series of time intervals in which the intensity evolves according to the RNN hidden states that encode the history of the acoustic signal. It aims at allocating the latent acoustic events in continuous time. Such events are efficiently drawn from the RPP using a sampling-free solution in an analytic form. The speech signal containing latent acoustic events is reconstructed/sampled dynamically from the discretized acoustic features using linear interpolation, in which the weight parameters are estimated from the onset of these events. The above processes are further integrated into an SRU, forming our final model, called recurrent Poisson process unit (RPPU). Experimental evaluations on ASR tasks including ChiME-2, WSJ0 and WSJ0&1 demonstrate the effectiveness and benefits of the RPPU. For example, it achieves a relative WER reduction of 10.7% over state-of-the-art models on WSJ0.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recurrent Poisson Process Unit for Speech Recognition

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 14

Similar Papers

Ultra-Short-Term Photovoltaic Power Generation Prediction Based on Hunter–Prey Optimized K-Nearest Neighbors and Simple Recurrent Unit
Yin Tang ... Yingchun Kuang
Applied Sciences | VOL. 14
Yin Tang, et. al.Yin Tang ... Yingchun Kuang
05 Mar 2024
Applied Sciences | VOL. 14

Short- to Medium-Term Sea Surface Height Prediction in the Bohai Sea Using an Optimized Simple Recurrent Unit Deep Network
Pengfei Ning ... Xuefeng Zhang
Frontiers in Marine Science | VOL. 8
Pengfei Ning, et. al.Pengfei Ning ... Xuefeng Zhang
17 Sep 2021
Frontiers in Marine Science | VOL. 8

A Parallel Optimized Load Forecasting Method Based on Simple Recurrent Units
Jianguang Zhang ... Qian Ai
-
Jianguang Zhang, et. al.Jianguang Zhang ... Qian Ai
01 May 2020
01 May 2020

Spatio-temporal SRU with global context-aware attention for 3D human action recognition
Qingshan She ... Gaoyuan Mu
Multimedia Tools and Applications | VOL. 79
Qingshan She, et. al.Qingshan She ... Gaoyuan Mu
14 Jan 2020
Multimedia Tools and Applications | VOL. 79

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recurrent Poisson Process Unit for Speech Recognition

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence