Overview of end-to-end speech recognition

Song Wang,Guanyu Li

doi:10.1088/1742-6596/1187/5/052068

Overview of end-to-end speech recognition

Song Wang, Guanyu Li

Open Access

https://doi.org/10.1088/1742-6596/1187/5/052068

Copy DOI

Journal: Journal of Physics: Conference Series	Publication Date: Apr 1, 2019
Citations: 13	License type: cc-by

Affiliation: Minzu University of China

#Speech Recognition #Speech Recognition Method + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In the 1960s, automatic speech recognition has been widely studied. In the past, HMM has been the mainstream of the acoustic model. With the development of machine learning, neural network is introduced to the speech recognition, relying on neural network’s strong learning ability. Thus the acoustic model of DNN has significantly improved the voice recognition rate, compared to the HMM model. In order to simplify the traditional speech recognition system, the end-to-end speech recognition method is proposed. This paper mainly introduces and analyzes the end-to-end system, and the main two models of CTC and attention, as well as the prospect of future speech recognition research.

Full Text