Development of a Low-Latency and Real-Time Automatic Speech Recognition System

Chee Siang Leow,Hiromitsu Nishizaki,Tomoaki Hayakawa,Norihide Kitaoka

doi:10.1109/gcce50665.2020.9291818

Development of a Low-Latency and Real-Time Automatic Speech Recognition System

Chee Siang Leow, Hiromitsu Nishizaki + Show 2 more

https://doi.org/10.1109/gcce50665.2020.9291818

Copy DOI

Export

Save

Cite

Publication Date: Oct 13, 2020

Citations: 4

Affiliation: University of Yamanashi, Toyohashi University of Technology

#Automatic Speech Recognition System #Automatic Speech Recognition #Corpus Of Spontaneous Japanese #Real-Time Speech Recognition System #Real-Time Automatic Speech Recognition #Automatic Speech Recognition Accuracy #Real-Time Speech Recognition #Voice Activity Detection #Real-Time Speech #Audio Transmitter

Abstract
Full-Text
Similar Papers

Abstract

Listen

In this study, a real-time automatic speech recognition (ASR) system based on the Kaldi ASR toolkit, with low-latency and customizable models, without any internet connection, was developed. The proposed ASR system includes a voice activity detection (VAD) module and an audio transmitter as a front-end speech processing and a decoder for the received audio signals. The ASR system was evaluated in terms of ASR accuracy and speech processing speed. Consequently, the ASR system achieved high ASR accuracy on the CSJ (Corpus of Spontaneous Japanese) test set with super low-latency.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Development of a Low-Latency and Real-Time Automatic Speech Recognition System

Abstract

Published Version

Talk to us

Similar Papers

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Development of a Low-Latency and Real-Time Automatic Speech Recognition System

Abstract

Published Version

Talk to us

Similar Papers