An End-to-End Speech Recognition System Based on Shared Encoder

Zhengchang Wen,Yi Ding,Yingwei Liang,Qingyao Wu,Hailiang Huang,Xin Cheng

doi:10.1109/icebe55470.2022.00016

Abstract

With the development of streaming media, automatic speech recognition (ASR) has been widely used in online education, live broadcast and other fields. However, for a better recognition effect in the real scenario, it is necessary to combine various technologies, such as front-end voice endpoint detection and back-end language model. In order to filter sensitive words in real scenarios, we require good online recognition and decoding methods. This paper presents an End-to-End speech recognition system, which unifies stream and non-stream speech recognition based on a shared encoder, and contains an additional CTC structure in the middle layer. Based on the monosyllable feature of mandarin, we calculate the probability distribution of syllables in the middle layer. The results show that our method is reliable for recognition in educational scenarios. We have achieved good results on aishell-l and audio in real scenarios provided by the company. At the same time, this system provides accurate syllable information to analyze sensitive words further.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An End-to-End Speech Recognition System Based on Shared Encoder

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

An FPGA-Based Embedded Robust Speech Recognition System Designed by Combining Empirical Mode Decomposition and a Genetic Algorithm
Shing-Tai Pan ... Xu-Yu Li
IEEE Transactions on Instrumentation and Measurement | VOL. 61
Shing-Tai Pan, et. al.Shing-Tai Pan ... Xu-Yu Li
01 Sep 2012
IEEE Transactions on Instrumentation and Measurement | VOL. 61

Speech Enhancement and Recognition Using Deep Learning Algorithms: A Review
D Hepsiba ... L D Vijay Anand
-
D Hepsiba, et. al.D Hepsiba ... L D Vijay Anand
01 Jan 2023
01 Jan 2023

Speech Enhancement System for Automatic Speech Recognition in Automotive Environment
Gokul G Nair ... C Santhosh Kumar
-
Gokul G Nair, et. al.Gokul G Nair ... C Santhosh Kumar
06 Jul 2021
06 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An End-to-End Speech Recognition System Based on Shared Encoder

Abstract

Talk to us

Similar Papers